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Description 

Tgghpical R^W 

5 [0001 ] The present invention relates to new isolates of the viral class Hepatitis C. polypeptides, polynucleotides and 
antibodies derived therefrom, as well as the use of such polypeptides, polynudeotides and antibodies in assays (e.g., 
immunoassays, nucleic add hybridization assays, eta) and in the production of viral polypeptides. 

10 

[0002] Non-A, Non-B hepatitis (NANBIH) is a transmissible disease or family of diseases that are believed to be viral- 
induced, and that are distinguishable from other forms of viral-assodated liver diseases, including that caused by the 
known hepatitis viruses, i.e.. hepatitis A virus (HAV), hepatitis B virus (HBV). and delta hepatitis vims (HDV). as well as 
the hepatitis induced by cytomegalovims (CMV) or Epstein-Barr vims (EBV). NANBH was first identified in transfused 

IS individuals. Transmission from man to chimpanzee and serial passage in chimpanzees provided evidence tinat NANBH 
is due to a transmissible infectious agent or agents. Epidemiologic evidence is suggestive tiiat there may be three types 
off NANBH: the water-borne epidemic type; the blood or needle associated type; and ttie sporadically occun-ing (com- 
munity acquired) type. Howa^er, until recentiy, no transmissible agent responsible for NANBH had not been Identified. 
[0003] ainical diagnosis and identification of NANBH has been accomplished primarily by exdusion of other viral 

20 mari«rs. Among ttie methods used to detect putative NANBH antigens and antibodies are agar^el diffusion, counter- 
immunoeledrophoresis, immunofluorescence microscopy, immune electron miaoscopy. radioimmunoassay, and 
enzyme-iinl«d immunosortDent assay. However, none of ttiese assays has proved to be sufffidentiy sensitive, specific, 
and reproducible to be used as a diagnostic test for NANBH. 

[0004] Until recentiy tiiere has been neither darity nor agreement as to ttie identity or specificity of ttie antigen anti- 
25 body systems associated wrth agents of NANBH. It is possible ttiat NANBH is caused by more ttian one infectious agent 
and undear what the serological assays detect in ttie semm of patients witti NANBH. 

[0005] In the past a number off candidate NANBH agents were postulated. See, ag.. Prince (1983) Ann. Rev. Micro- 
bid. aZ:217; Feinstone & Hoofnagle (1984) New Eng. J. Med. 311:185; Overby (1985) Cun-. Heptol. 5:49; Overby 
(1986) Cunr. Hepld. S:65; Overby (1987) Curr. Heptol. 2:35; and Iwarson (1987) British Med. J. 2^:946. However, ttiere 

30 is no proof ttiat any of ttiese candidates represent ttie etidogical agent off NANBH. 

[0006] In 1987. Houghton et al. doned ttie first virus definitively linked to NANBH. See, e.g.. EPO Pub. Na 318,216; 
Houghton et al., Sdence 244:359 (1989). Houghton et al. described ttierein the cloning of an isolate from a new viral 
dass, hepatitis C vims (HCV). ttie prototype isolate descrtoed ttierein being named "HCVr. HCV is a Flavi-like virus, 
witti an RNA genome. Houghton et al. desaibed the production of recombinant proteins from HCV sequences that are 

35 useful as diagnostic reagents, as well as polynudeotides useful in diagnostic hybridization assays and in ttie doning of 
additional HCV isolates. 

[0007] The demand for s^isitive, specific mettiods for screening and identiffying earners off NANBH and NANBH con- 
taminated Uood or blood products is significant Post-transfusion hepatitis (PTH) occurs in approximately 10% off trans- 
fused patients, and NANBH accounts for up to 90% off ttiese cases. TTiere is a frequent progression to chronic liver 
40 damage (25-55%). 

[0008] Patient care as well as ttie prevention of transmission of NANBH by blood and Wood products or by dose per- 
sonal contact require reliable diagnostic and prognostic tods to detect nudeic adds, antigens and antibodes related to 
NANBH. In addition, there is also a need for effective vaccines and immunottierapeutic ttierapeutic agents for the pre- 
vention and/or treatment of tiie disease. 
45 [0009] While at least one HCV isolate has been identified which is useful in meeting the above needs, additional iso- 
lates, particulariy ttiose witti divergent a genome, may prove to have unique applications. 

Summarv of ttie Invention 

50 [0010] New isolates of HCV has been characterized from Japanese Wood donors who have been implicated as 
NANBH carriers. TTiese isolates exhibit nudeotide and amino acid sequence heterogeneity witti respect to ttie proto- 
type isdate, HCV1, in several viral domains. It is believed that these distinct sequences are of in importance, particu- 
lariy in diagnostic assays and in vaccine development. 

[0011] In one embodiment, the present invention provides a polynucleotide in substantially isolated form conprising 
55 a nudeotide sequence of at least 8 nudeotides from a J-1 HCV isolate, said J-1 HCV isolate having at least 90% nude- 
otide sequence homology witti tiie J-1 sequence of any one of Figures 7 to 10 or 13 to 18. wherein said nucleotide 
sequence of at least 8 nudeotides is distinct from the nudeotide sequence of HCV isdate HCV-1. 
[0012] In anottier embodiment d ttie present invention provides a purified polypeptide comprising a amino add 
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sequaK:e which: 

(a) is encoded by a nucleotide sequence of the invention, said coding being in frame with the con-esponding amino 
add sequences set out in Rgures 7 to 10 or 13 to 18; 

(b) comprises an antigenic detenrtinant; and 

(c) contains at least one amino add substitution compared to the amino acids encoded by the corresponding por- 
tion of the HCV-1 sequence. 

[00131 Yet another embodiment of the present Invention provides a polypeptide of the invention immobilized on the 
solid support 

[0014] In a further embodiment of the present invention, an imnrujnoassay for detecting the presence of anti-HCV anti- 
bodies in a test sample is provided comprising: (a) incubating the test sample under conditions ttiat allow the formation 
of antigen-antibody complexes with polypeptide of the invention, wherein the polypeptide is not imnmjnologically cross- 
reactive with HCV-1 and (b) detecting any antigen-anttedy complexes formed. 

[001 5] Yet another embodiment of ttie present invention provides a method of detecting HCV polynudeotides In a test 
sample comprising: (a) providing a polypeptide of the invention as a probe; (b) contacting the test sanple and ttie probe 
under conditions that allow for the formation of a polynudeotide duplex between the probe and its conplement in ttie 
absence of sitetantial polynudeotide duplex formation between the probe and non-HCV pdynudeotide sequences 
present in the test sample; and (c) detecting any polynudeotide duplexes comprising ttie probe. 
[0016] These and other embodiments of ttie present invention will be readily apparent to ttiis of ordinary skill in ttie 
art in view of the following desaiption. 

Brief description of the Hgures 

[0017] 

Rgure 1 shows ttie consensus sequence of ttie coding strand of a fragment from the J7 G/E domain witti ttie het- 
erogeneities. 

Figure 2 shows the consensus sequence of ttie coding strand of a fragment from the J1 E domain witti ttie hetero- 
geneities. 

Rgure 3 shows ttie consensus sequence of ttie coding strand of a fragment of ttie J1 E/NS1 domain witti ttie het- 
erogeneities. 

Rgure 4 shows ttie consensus sequence of ttie coding strand of a fragment from ttie J1 NS3 domain witti ttie het- 
erogenertiea 

Rgure 5 shows ttie consensus sequence of ttie coding strand of a fragment from ttie J1 NS5 domain witti ttie het- 
erogeneities. 

Rgure 6 shows tfie homology of ttie J7 C/E consensus sequence witti tiie nudeolide sequence of ttie same domain 
fromHCVI. 

Rgure 7 shows ttie homology of ttie J1 E consensus sequence witti ttie nudeotide sequence of the same domain 
fromHCVI. 

Rgure 8 shows ttie homology of ttie J 1 E/NS1 consensus sequence witti ttie nudeotide sequence of ttie same 
domain fromHCVI. 

Rgure 9 shows ttie homology of ttie J 1 NS3 consensus sequence witti ttie nudeotide sequence of ttie same 
domain from HCV1. 

Rgure 10 shows ttie homology of ttie J1 NS5 consensus sequence witti ttie nudeotide sequence of ttie same 
domain from HCV1. 

Rgure 1 1 shows ttie putative genomic organization of ttie HCV1 genome. 

Rgure 12 shows ttie nudeotide sequence of ttie ORF of HCV1. In ttie figure nudeotide number 1 is ttie first A of 
ttie putative initiating mettiionine of ttie large ORF; nudeotides upstream of ttiis nudeotide are numbered witti neg- 
ative numbers. 

Rgure 13 shows ttie consensus sequence of ttie cocfing strand of a fragment from ttie J1 NS1 domain (J1 1519) 
witti ttie nudeotide sequence of ttie same domain from HCV1 . Also shown are ttie amino adds encoded ttierein. 
Rgure 14 shows a composite of the consensus sequence from ttie core to ttie NS1 domain of J1 witti ttie nucleotide 
sequence of ttie same domain from HCV1. Also shown are ttie amino acids encoded ttierein. 
Rgure 15 shows a consensus sequence of ttie coding strand of the NSl donnain of J1. as detemiined in Exanple 
IV Also shown are ttie nudeotide sequence of ttie same domain from HCV1 . and ttie amino adds encoded in tfie 
HCV1 and J1 sequences. 

Rgure 16 shows a consensus sequence of a coding strand of ttie C200 region of ttie NS3-NS4 domain of J1. Also 
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shown are the nucleotide sequence of the sanne domain from HCV1 . Also shown are the amino acids encoded in 
the sequences. 

Figure 1 7 shows a consensus sequence of the coding strand of the NSl domain of J1 . as determined in Exanple 
V. Also shown are the nucleotide sequence of the same domain from HCV1, and the amino acids encoded in the 
5 sequences. 

Figure 18 shows a consensus sequence of the coding strand of the untranslated and core domains of J1. Also 
shown are the nucleotide sequence of the same domain from HCV1. and the amino acids encoded in the 
sequences. 

10 Petailed Pescriptjon of the Invention 

[0018] The practice of the present invention will employ; unless othenwse indicated, conventional techniques of 
molecular biology, microbiology, recombinant DMA techniques, and immunology, which are within the skill of the art. 
Such techniques are explained fully in the literature. Sfifl e.g.. Maniatis. Fitsch & Sambrook, MOLECULAR CLONING; 

IS A LABORATORY MANUAL (1982); DNA CLONING, VOLUMES I AND II (D.N Gover ed. 1985); OLIGONUCLEOTIDE 
SYNTHESIS (M.J. Gait ed. 1984); NUCLEIC ACID HYBRIDIZATION (B.D. Names & S.J. Higgins eds. 1984); TRAN- 
SCRIPTION AND TRANSLATION (B.D. Hemes & S.J. Higgins eds. 1984); ANIMAL CELL CULTURE (R.I. Freshney ed. 
1986); IMMOBIUZED CELLS AND ENZYMES (IRL Press. 1986); B. Perbal, A PRACTICAL GUIDE TO MOLECULAR 
CLONING (1984); the series. METHODS IN ENZYMOLOGY (Academic Press, Inc.); GENE TRANSFER VECTORS 

20 FOR MAMMALIAN CELLS (J.H. Miller and M.P. Calos eds. 1987, Cold Spring Harbor Laboratory), Methods in Enzy- 
mology Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, eds.. respectively), Mayer and Walker, ecte. (1987), IMMU- 
NOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press, London), Scopes. (1987), 
PROTEIN PURIFICATION: PRINCIPLES AND PRACTICE. Second Edition (Springer-Verlag. N.Y.). and HANDBOOK 
OF EXPERIMENTAL IMMUNOLOGY, VOLUMES l-IV (D.M. Weir and C. C. Blackwell eds 1986). All patents, patent 

25 applications, and other publications mentioned herein, both supra and infra, are hereby incorporated herein by refer- 
enca 

10019] The term "hepatitis C virus" has been reserved by workers in the field for an heretofore unknown etiologic 
agent of NANBH. Accordingly, as used herein, "hepatitis C virus" (HCV) refers to an agent causative of NANBH. whteh 
was formerly refored to as NANBV and/or BB-NANBV from the dass of the prototype isolate. HCV1. described by 

30 Houghton et al. See. ag., EPO Pub. No. 318.216 and U.S. Patent App. Serial No. 355.002, filed 19 May 1989 (available 
in non-U. S. applications claiming priority therefrom), the disclosures of which are incorporated herein by referenca The 
nucleotide sequence and putative amino acid sequence of HCV1 is shown in Rgure 6. The terms HCV, NANBV, and 
BB-NANBV are used interchangeably herein. As an extension of thfe terminology, the disease caused by HCV, famerly 
called NANB hepatitis (NANBH). is called hepatitis C. The terms NANBH and hepatitis C may be used interchangeably 

35 herein. The term "HCV", as used herein, denotes a viral species of which pathogenic strains cause NANBH, as well as 
attenuated strains or defective interfering particles derived therefrom. 

[0020] HCV is a Flavi-like virus. The morphology and composition of Ravivirus particles are known, and are dis- 
cussed by Brinton (1986) THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE (Series eda Fraenkel-Conrat and 
Wagner, vol eda Schlesinger and Schlesinger, Plenum Press), p.327-374. Generally, with respect to morphology, Ra- 
40 viviruses contain a central nudeocapsid surrounded by a lipid bilayer. Virions are spherical and have a diameter of 
about 40-50 nm. Their cores are about 2S30 nm in diameter. Atong the outer surface of the virion envelope are projec- 
tions that are about 5-10 nm tong with terminal knobs akx>ut 2 nm in diameter. 

[0021] The HCV genome is comprised of RNA. It is known that RNA containing viruses have relatively high rates of 
spontaneous mutation, i,e., reportedly on the order of 1 0"^ to 1 0*^ per incorporated nucleotide. Therefore, there are mul- 

45 tiple strains, which may be virulent or avirulent. within the HCV dass or species. 

[0022] It is believed that the genome of HCV isolates is comprised of a single ORF of approximately 9,000 nudeotides 
to approximately 1 2,000 nudeotides. encoding a polyprotein similar in size to that of HCV1 . an encoded polyprotein of 
similar hydrophobic and antigenic character to that of HCV1 , and the presence of co-linear peptide sequences that are 
conserved with HCV1 . In addition, the genome is believed to be a positive-stranded RNA, 

so [0023] Isolates of HCV connprise epitopes that are immunologically aoss-reactive with epitopes in the HCV1 genome. 
At least some of these are epitopes unique to HCV when compared to other known Raviviruses. The uniqueness of the 
epitope may be detennined by its immunological reactivity with anti-HCV antibodies and lack of immundogical reactivity 
with antibodies to other Ravivirus spedes. Methods for detenmining immunological reactivity are known in the art for 
exanple, by radidmmunoassay. by ELISA assay, by hemagglutination, and several examples of suitable techniques for 

5S assays are provided herein. 

[0024] It is also expected that the overall homdogy of HCV isolates and HCV1 genomes at the nudeotide level prob- 
ably will be about 40% or greater, probably about 60% or greater, and even more probably about 80% to about 90% or 
greater. In adcfition that there are many corresponding contiguous sequences of at least about 13 nudeotides that are 
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fully homologous. The correspondence between the sequence from a new isolate and the HCV1 sequence can be 
determined by techniques known in the art For example, they can be determined by a direct comparison of the 
sequence infbnnation of the polynucleotide from the new isolate and HCV1 sequences. Alternatively, homology can be 
determined by hybridization of the polynucleotides under conditions which fbmi stable duplexes between honx>fogous 
5 regions (Ibr example, those which would be used prior to Si digestion), followed by digestion with single-stranded spe- 
cific nudease(s), followed by size determination of the digested fragments. 

[0025] Because of the evolutionary relationship of the strains or isolates of HCV. putative HCV strains or isolates are 
identifiable by their homology at the polypeptide level. Thus, new HCV isolates are expected to be more than about 40% 
homologous, probably more than about 70% homologous, and even nwre probably more than about 80% homologous. 
10 and possibly even more than about 90% homologous at the polypeptide level. TTie techniques for determining amino 
acid sequence homology are known in the art For example, the amino acid sequence may be determined directly and 
compared to the sequences provided herein. Alternatively the nucleotide sequence of the genomk; material of the puta- 
tive HCV may be determined, the amino add sequence encoded therein can be determined, and the corresponding 
regions compared. 

T5 [0026] TTie ORF of HCV1 is shown in Rgure 1^ The non-structural, cae. and envekspe domains of the polyprotein 
have been predicted for HCV1 (Rgure 5). The "C". or core, polypeptkie is believed to be encoded from the 5' tenninus 
to about nucleotide 345 of HCV1 . The putative "E". or envek)pe. domain of HCV1 is believed to be encoded from about 
nudeotkle 346 to about nudeotide 1050. Putative NS1 , or non-structural one domain, is thought to be encoded from 
about nudeotide 1051 to about nudeotkie 1953. For the remaining domains, putative NS2 is thought to be encoded 

20 from about nudeotkie 1954 to about nudeotkie 3018, putative NS3 from about nudeotide 3019 to about nudeotkie 
4950. putative NS4 from about nudeotkJe 4951 to about nudeotkie 6297, and putative NS5 from about nudeotkie 6298 
to the 3* temiinus respectively. The above boundaries are approximations based on an analysis of the ORF The exact 
boundaries can be determined by those skilled in the art in view of the disdosure herein. 

[0027] •'HCV/jr or "Jl" and "HCV/jr or "Jr refer to new HCV isolates characterized by the nudeotWe sequence 
25 disdosed herein, as well as related isdates that are substantially homologous thereto; i.a, at least about 90% or about 
95% at the nudeotide level. It is believed that the sequences disclosed herein characterize an HCV subclass that is pre- 
dominant in Japan and other Asian and/or Padfk: rim countries. Additk)nal J1 and J7 isolates can be obtained in view 
of the disdosure herein and EPO Pub. No. 318.216. In partkxjlar.theJI and J7 nudeotkie sequences disdosed herein, 
as well as the HCV1 sequences in Rgure 12. can be used as primers or probes to done additional domains of J1 , J7, 
30 or additional isolates. 

[0028] As used herein, a nudeotide sequence "from" a designated sequence or source refers to a nudeotkie 
sequence that is honxjiogous (i.e., Wentical) to or complementary to the designated sequence or source, or a portion 
thereof. The J1 sequences provkied herein are a minimum of 15 nudeotkles preferably 20 nudeotkJes or longer. The 
maximum length is the complete viral genome. 

35 [0029] In some aspects of the invention, the sequence of the region from which the polynudeotWe is derived is pref- 
erably honrxjlogous to or complementary to a sequence which is unk^ue to an HCV genome or the J 1 genome. Whether 
or not a sequence is unkjue to a genome can be determined by techniques known to those of skill in the art For exam- 
ple, the sequence can be compared to sequences in databanks, e.g., Genebank, to determine whether it is present in 
the uninfected host or other organisms. The sequence can also be compared to the known sequences of other viral 

40 agents, induding those which are known to induce hepatitis. e.g.. HAV, HBV, and HDV. and to other members of the Ra- 
viviridae. The correspondence or non-conespondence of the derived sequence to other sequences can also be deter- 
mined by hybrklization under the appropriate stringency conditions. Hybridizatwn technkjues for determining the 
complementarity of nudeic add sequences are known in the art See also, for example, Maniatis et al. (1982) MOLEC- 
ULAR CLONING; A LABORATORY MANUAL (CoW Spring Haibor Press, CoW Spring Harbor, N.Y). In addition, mis- 

45 matches of duplex polynudeotkies fomied by hybridization can be determined by known technk^ues, inducing for 
example, digestion with a nudease such as SI that specifically digests single-stranded areas in duplex polynudeotides. 
Regtons from which typical DNA sequences may be derived include, but are not limited toi regions encoding spedfk: 
epitopes, as well as non-transcrbed and/or non-translated regk)n& 

[0030] The J1 polynudeotkie is not necessarily physically derived from the nudeotkie sequence shown, but may be 
so generated in any manner, induding for example, chemical synthesis or DNA replication or reverse transcription or tran- 
scription. In addition, combinations of regions con-esponding to that of the designated sequence may be modified in 
ways known in the art to be consistent with an intended use. The polynudeotides may also indude one or more labels, 
which are known to those of skill in the art 

[0031 ] An amino acki sequence "from" a designated polypeptide or source of polypeptides means that the amino add 
55 sequence is homologous fi.e.. identicaO to the sequence of the designated polypeptide, or a portion thereof. An amino 
acki sequence "from" a designated nudeic ackl sequence refers to a polypeptide having an amino add sequence klen- 
tical to that of a polypeptide encoded in the sequence, or a portion thereof. The J1 amino add sequences in the 
polypeptkles of the present invention are at least 10 amino adds, more preferably at least about 15 amino adds, and 
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most preferably at least about 20 amino acids. 

[0032] The polypeptides of the present invention are not necessarily translated from a designated nudeic acid 
sequence: the polypeptides may be generated in any manner, including Ibr exanple. chemical synthesis, or expression 
of a recombinant expression system, or isolation from virus. The polypeptides may include one or more analogs of 
5 amino acids or unnatural amino acids. Methods of inserting analogs of amino adds Into a sequence are known in the 
art The pdypeptides may also include one or more labels, which are known to those of skill in the art. 
[0033] The term "recombinant polynudeotide" as used herein intends a polynudeotide of genomic. cDNA. semisyn- 
thetic or synthetic origin which, by virtue of its origin or manipulation: (1) is linked to a polynudeotide other than that to 
whk:h it is linked in nature, or (2) does not occur in nature. 
10 [0034] The term "polynudeotide'' as used herein refers to a polymerk; form of nudeotides of any length, either rftx>- 
nudeotides or deoxyribonudeotides. TWs term refers only to the primary structure of the molecule. Tlius. this term 
indudes double- and single-stranded DNA. and RNA. It also includes known types of nrwdifications, for exanple, labels 
which are known in the art. methylation. "caps", substitution of one or wore of the naturally ocoffring nudeotides with 
an analog, internudeotide modifications such as. for example, those with uncharged linkages (ag.. methyl phospho- 
rs nates, phosphotriesters. phosphoamidates, carbamates, etc.) and with charged linkages (e.g.. phosphorothioates, 
phosphorodithioates. eta), those containing pendant moieties, such as. for example proteins (induding for e.g., nude- 
ases. toxins, antibodies, signal peptides. poly-L-lysine. etc.). those with intercalators {e.g.. acrkiine, psaalen, etc.). 
those containing chelators (e.g.. metals. radk>actrve metals, boron, oxidative metals, etc.). those containing alkylatoiB. 
those with nrxxlified linkages (e.g., a^ha anonrteric nudeic adds, eta), as well as unmodified fbnns of the pdynude- 
20 otide. 

[0035] "Purified polynudeotxje" refers to a corrposition comprising a specified polynudeotkle that is substantially free 
of other components, such composition typically comprising at least about 70% of the specified polynudeotkle. more 
typically at least about 80%. 90% or even 95% to 99% of the specified polynudeotkle. 

[0036] "Purified polypeptWe" refers to a conposition comprising a specified polypeptkje that is substantially free of 
25 other components, such composition typically comprising at least about 70% of the specified polypeptkJe. more typi- 
cally at least about 80%. 90% or even 95% to 99% of the specified polypeptide. 

[0037] "Recombinant host ceHs", "host cells", "cells", "cell lines", "cell cultures", and other such terms denote micro- 
organisms or higher eukaryotic cell lines cultured as unicellular entities that can be. or have been, used as recipients 
for a recombinant vector or other transfer DMA. and indude the progeny of the original cell which has been transformed. 

30 It is understood that the progeny of a single parental cell may not necessarily be conrpletely identteal in morphology or 
in genomic or total DNA complement as the original parent due to natural, accidental, or deliberate mutation. 
[0038] A "replicon" is any genetic element e.g.. a plasmkj, a chromosome, a virus, a cosmW. eta that behaves as an 
autonomous unit of polynudeotide replication witiiin a cell; i.a, capable of replteation under its own control. 
[0039] A "doning vector" is a replicon ttiat can transform a selected host cell and in which another polynudeotide seg- 

35 ment is attached, so as to bring about the replication and/a expression of the attached segment. Typically, doning vec- 
tors indude plasnrtids. virus (e.g.. bacteriophage vector) and oosmid& 

[0040] An "integrating vector" is a vector ttiat does not behave as a replicon in a selected host cell, but has the ability 
to integrate into a replicon (typfcally a chromosome) resklent in ttie selected host to stably transform the host. 
[0041 ] An "expression vector" is a construct ttiat can transform a selected host cell and provkJes for expression of a 
40 heterologous coding sequence in the selected host Expression vectors can be eittier a doning vector or an integrating 
vector. 

[0042] A "coding sequence" is a polynucleotide sequence whfch is transaibed into mRNA and/or translated into a 
polypeptide when placed under ttie control of appropriate regulatory sequences. The boundaries of the coding 
sequence are detennined by a translation start codon at the 5 -terminus and a translation stop codon at the 3'-terminus. 

45 A coding sequence can indude. but is not limited to mRNA. cDNA. and recombinant polynudeotide sequences. 

[0043] "Control sequence" refers to polynucleotide regulatory sequences which are necessary to effect the expres- 
sion of coding sequences to which ttiey are ligated. The nature of such control sequences differs depending upon ttte 
host organism. In prokaryotes, contrd sequences generally indude pronxrter. rtoosomal binding site, and tenminators. 
In eukaryotes generally control sequences indude promoters, terminators and, in some instances, enhancers. TTie 

so term "control sequences" is intended to indude, at a minimum, all components the presence of which are necessary for 
expression, and may also indude additional advantageous components. 

[0044] "OperaWy linked" refers to a juxtaposition wherein the components so described are in a relationship permit- 
ting them to function in ttieir intended manner. A control sequence "operaWy linked" to a coding sequence is ligated in 
such a way that expression of the cocfing sequence is achieved under conditions compatible with ttie control 
55 sequences. 

[0045] An "open reading frame" or ORF is a regwn of a polynudeotide sequence which encodes a polypeptide; ttiis 
region may represent a portion of a coding sequence or a total coding sequence. 

[0046] "Immunologically cross-reactive" refers to two or wore epitopes or polypeptkies that are bound by ttie same 
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antibody. Cross-reactivity can be determined by any of a nunt)er of immunoassay techniques, such as a competition 
assay. 

[0047] As used herein, the term "antibody" refers to a polypeptide or group of polypeptides which comprise at least 
one epitope. An "antigen birxiing site" is fonned from the folding of the variable domains of an antibody molecule(s) to 

B form tttree-dmensional binding sites wrtti an internal surface shape and charge distribution complementary to tfie fea- 
tures of an epitope of an antigen, which allows specific binding to form an antibody^ntigen complex. An antigen binding 
site may be fbmned from a heavy- and/or light-chain domain (VH and VL respectively), which form hypervariaUe loops 
which contribute to antigen binding. The term "antibody" includes, without limitation, chimeric antftxxlies, altered anti- 
bodies,univalent antibodies. Fab proteins, and single-domain antftxxlies. In nrwiy cases, the biding phenomena of anti- 

10 bodies to antigens is equivalent to other ligand/anti-ligand binding. 

[0048] As used herein, a "single domain antibody" (dAb) Is an antibody which is comprised of an HL domain, which 
binds specifically witti a designated antigen. A dAb does not contain a VL domain, but may contain other antigen bind- 
ing domains known to exist to antflsodies, for example, the kappa and lambda domains. Methods for preparing dAbs are 
known in the art. See, for example. Ward et al. Nature 241: 544 (1 989). 

15 [0049] Antibodies may also be comprised of VH and VL domains, as well as other known antigen binding domains. 
Examples of these types of antibodies and methods for tiieir preparation and known in ttie art (see, ag.. U.S. Patent 
No. 4.816.467. whk*i is incorporated herein by reference), and include ttie following. For exan^le, "vertebrate antftxxJ- 
ies" refers to antibodies which are tetramers or aggregates thereof, comprising light and heavy chains which are usually 
aggregated in a "Y" configuration and which may or may not have covalent linkages between the chains. In vertebrate 

20 antibodies, the amino acki sequences of ttie chains are homologous witti those sequences found in antibodies pro- 
duced in vertebrates, whether in situ or in vitro (for example, in hybridomas). Vertebrate antibodies include, for exanple, 
purified polyctonal antibodes and monoclonal antibodies, methods for the preparation of which are desaibed infra 
[0050] "Hybrid antibodies" are antibodies where chains are separately homologous witti reference to mammalian anti- 
body chains and represent novel assemblies of ttiem, so that two different antigens are predpitable by the tetramer or 

25 aggregata In hybrid antibodies, one pair of heavy and light chains are homotogous to those found in an anttoody raised 
against a first antigen, while a second pair of chains are homotogous to those found in an antibody raised against a sec- 
ond antibody This results in the property of "divalence". i.a, ttie ability to bind two antigens simultaneously. Such 
hybrids may also be formed using chimeric chains, as set fbrtii betow. 

[0051] "Chimeric anttoodies" refers to antibodies in which ttie heavy and/or light chains are fusion proteins. Typically, 
30 one portion of ttie amino acid sequences of ttie chain is homologous to conresponding sequences in an antibody 
derived from a particular species or a particular dass, while ttie remaining segment of ttie chain is homologous to ttie 
sequences derived from anottier species and/or class. Usually, ttie variable region of both light and heavy chains mim- 
ics the variable regions or antibodies derived from one species of vertebrates, while ttie constant portions are homolo- 
gous to ttie sequences in ttie antibodies derived from anottier species of vertebrates. However, ttie definition is not 
35 limited to ttiis particular exampla Also included is any anttoody in which eittier a botti of ttie heavy or light chains are 
composed of combinations of sequences mimicking ttie sequences in antibodies of different sources, whettier ttiese 
sources be from differing classes or different species of origin, and whetfier or not ttie fuston point is at ttie variable/con- 
stant boundary. Thus, it is possible to produce antibodies in which neittier ttie constant nor ttie variable regton mimic 
know antibody sequences. It ttien becomes possible, for example, to construct antibodies whose variable region has a 
40 higher specific affinity for a particular antigen, or whose constant region can elicit enhanced complement fixation, or to 
make other inprovements in properties possessed by a particular constant region. 

[0052] Anottier example is "altered antibodies", which refers to anttoodies in which ttie naturally occurring amino acki 
sequence in a vertebrate antibody has been varies. Utilizing recombinant DNA techniques, antibodies can be rede- 
signed to obtain desired characteristics. The posstole variations are many, and range from the changing of one or more 

45 amino adds to ttie complete redesign of a region, for example, ttie constant region. Changes in ttie constant regton, in 
general, to attain desired cellular process characteristics, ag.. changes in complement fixation, interaction witti mem- 
branes, and ottier effector functions. Changes in ttie variable region may be made to alter antigen binding characteris- 
tica The anttoody may also be engineered to aid ttie specific delivery of a molecule or substance to a spedfte cell or 
tissue sita The desired alterations may be made by known technk^ues in molecular btology. ag., recombinant tech- 

50 niques. site-directed mutagenesis, etc. 

[0053] Yet anottier example are "univalent anttoodies". which are aggregates comprised of a heavy-chain/light-chain 
dimer bound to ttie Fc O e., stem) regton of a second heavy chain. This type of antibody escapes antigenic modulation. 
See, ag.. Glennie et al. Nature 295: 712 (1982). Induded also wittiin ttie definition of anttoodies are "F«b" fragments of 
antibodies. The "Fab" region refers to ttiose portions of ttie heavy and light chains which are roughly equivalent, or anal- 

55 ogous. to ttie sequences which comprise ttie branch portion of ttie heavy and light chains, and which have been shown 
to exhibit immunotogical bincfing to a specified antigen, but which lack the effector Fc portion. "Fab" includes aggregates 
of one heavy and one light chain (commonly known as Fab*), as well as tetramers containing ttie 2H and 2L chains 
(referred to as F(ab)2)» which are capable of selectively reacting wrtti a designated antigen or antigen family Rib anti- 
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bodies may be divided into subsets analogous to those described above, i.e.. Vertebrate Fab". Iiybrid Fab', "chimeric 
Fab", and "altered Fab". Methods of producing Fab fragments of antftxxiies are known within the art and include, for 
example, proteolysis, and synthesis t>y recombinant techniques. 

[0054] "Epitope" refers to an antbody binding site usually d^ined by a polypeptide, but also by non-amino add hap- 
5 tens. An epitope could comprise 3 amino adds in a spatial conformation which is unique to the epitope, generally an 
epitope consists of at least 5 such amino adds, and more usually, consists of at least 8-10 such amino adds. 
[0055] "Antigen-anttoody complex" refers to the complex formed by an antibody that is specif Icaily bound to an epitope 
on an antigen. 

[0056] "Immunogenic polypeptide" refers to a pdypeptide that elicits a cellular and/or hunrwral innmune response in a 
10 oiammal. whether alone or linked to a carrier, in the presence or absence of an adjuvant 

[0057] "Polypeptide" refers to a polymer of amino adds and does not refer to a specific length of the molecula Thus, 
peptides, oligopeptides, and proteins are induded within the definitk}n of polypeptide. This term also does not refer to 
or exclude post-expression nrwdif ications of the polypeptide, for example, glycosylations. acetylations, phosphoryla- 
tions and the like. Induded within the definitton are. for example, polypeptides containing one or more analogs of an 
IS amino acid Gnduding, for example, unnatural amino adds, etc.), polypeptides with substituted linkages, as well as other 
nvxlif ications known in the art. both naturally occurring and non-naturally occurring. 

[0058] "Transformation', as used herein, refers to the insertion of an exogenous polynudeotide into a host cell, irre- 
spective of the nrethod used for the insertion, for exanple. direct uptake, transduction, f-mating or electroporatton. The 
exogenous polynudeotide may be maintained as a non-integrated vector, for example, a plasmid, or alternatively, may 
20 be integrated into the host genoma 

[0059] A Transformed" host cell refers to both the immediate cell that has undergone transformation and its progeny 

that maintain the originally exogenous polynudeotida 

[0060] "Treatment" as used herein r^ers to prophylaxis and/or therapy. 

[0061 ] "individual", refers to vertebrates, particulariy merTt}ers of the mammalian spedes, and indudes but is not lim- 

25 ited to domestic animals, sports animals, and primates, induding humana 

[0062] "Sense strand" refers to the strand of a double-stranded DNA molecule that is homologous to a mRNA tran- 
script thereof. The "anti-sense strand" contains a sequence which is complementary to that of the "sense strarxj". 
[0063] "Antibody<ontaining body componenT refers to a component of an individual's body which is a source of the 
antibodies of interest Antibody-containing body components are known in the art, and indude but are not limited to, 

30 whole Wood and corrponents thereof, plasma, semm. spinal fluid, lynrph fluid, the external sections of the respiratory, 
intestinal, and genitourinary tracts, tears, saliva, ntilk, white dood cells, and myelomaa 

[0064] "Purified HCV" isolate refers to a preparation of HCV partides which has been isdated from the cellular con- 
stituents with which the virus is normally associated, and from other types of viruses whfeh may be present in the 
infeded tissua The tediniques for isolating viruses are known to those of skill in the art and indude, for example, cen- 
35 trifiigation and affinity chromatography. 

[0065] An HCV "partide" is an entire virion, as well as partides which are intermediates in virion formation. HCV par- 
tides generally have one or more HCV proteins assodated with the HCV nudeic acid. 

[0066] "Probe" refers to a polynudeotide which fonms a hybrid structure with a sequence in a target polynudeotide. 
due to a complementarity of at least one region in the probe with a region in the target 

40 [0067] "Biological sample" refers to a sample of tissue or f luki isolated from an individual, induding but not limited to, 
for example, whde Wood and components thereof, plasma, serum, spinal fluid, the external sections of the sWn, respi- 
ratory, intestinal, and genitourinary tracts, tears saliva, milk, Wood cells, tumors, organs, and also samples of invrtro cell 
culture constituents (inducfing but not limited to condBtioned medium resulting from the growrth of cells in cell culture 
medium, putatively virally infected cells, recombinant cells, and cell components). 

45 [0068] The invention pertains to the isolation and characterization of a newly discovered isolate of HCV, J t. its nude- 
otide sequence, its protein sequence and resulting polynudeotides and polypeptides and antibodies derived therefrom. 
However, ttie isdation and characterization of another newly discovered isolate of HCV, J7, is also described herein, 
although it does not tonri part of the invention daimed. Isolates J1 and J7 are novel in ttieir nucleotide and amino add 
sequences, and are believed to be charaderistic of HCV isolates from Japan and other Asian countries. 

50 [0069] The nudeotide sequences derived from HCV J1 are useful as probes to diagnose the presence of virus in sam- 
ples, and to isolate other naturally occurring variants of the virua These nudeotide sequences also make availaWe 
polypeptide sequences of HCV antigens encoded within the J1 genome and permits the production of polypeptides 
whfeh are useful as standards or reagents in diagnostic tests and/or as components or vacdnes. Antibodies, both pol- 
yclonal and monodonal, directed against HCV epitodes contained within these polypeptide sequences are also useful 

55 for diagnostic tests, as ttierapeutic agents, for saeening of antiviral agents, and for ttie isolation of the NANBH virua In 
addition, by utilizing probes derived from the sequences cfisdosed herein it is possiWe to isolate and sequence other 
portions of the J1 genome, thus giving rise to additional probes and pdypeptides which are useful in the diagnosis 
and/or treatment both prophylactic and tiierapeutic. of NANBH. 
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[0070] The availability of the HCV/J1 nucleotide sequences enable the construction of polynucleotide probes and 
polypeptides useful in diagnosing NANBH due to HCV infection and in screening blood donors as well as donated blood 
and blood products for infection. For exanple. from the sequences it is possft)le to synthesize DNA oligomers of about 
8-10 nucleotides, or larger, which are useful as hybridization probes to detect the presence of HCV RNA in. for exanple. 

5 sera of subjects suspected of harboring the vims, or for screening donated Uood for the presence of the vims. The 
HCV/J1 sequences also allow the design and production of HCV specific polypeptides which are useful as diagnostic 
reagents for the presence of antibodies raised during NANBH. Antibodies to purified polypeptides derived from the 
HCV/J1 sequences may also be used to detect viral antigens in infected individuals and in blood. 
[0071 ] Kncwiedge of these HCV/J1 sequences also enable she design and production of polypeptides wNch may be 

10 used as vaccines against HCV and also for the production of antibodies, which in turn may be used for protection 
against the disease, and^or Ibr therapy of HCV infected individuals. Moreover, the disclosed HCV/J 1 sequences, enable 
further characterization of the HCV genoma Polynucleotide probes derived from these sequences, as well as from the 
HCV genome, may be used to screen cONA libraries for additional viral cDNA sequences, which, in turn, may be used 
to obtain additional overiapping sequences. See. e.g.. EPO Pubi Na 318.216. 

15 [0072] The HCV/J1 polynucleotide sequences, the polypeptides derived therefrom and the antftxxiies directed 
against these polypeptides, are useful in the isolation and identification of the BB-NANBV agent(8). For example, anti- 
bodies directed against HCV epitopes contained in polypeptides derived from the HCV/J1 sequences may be used in 
processes based upon affinity chromatography to isolate the vims. Alternatively, the antibodies may be used to identify 
viral particles isolated by other techniques. The viral antigens and the genomic material within the isolated viral particles 

20 may then be further characterized. 

[0073] The infomialion obtained from further sequencing of the HCV/J1 genome, as well as from further characteri- 
zation of the HCV/J1 antigens and characterization of the genomes enable the design and synthesis of additional 
probes and polypeptides and antftxxiies which may be used for diagnosis, for prevention, and for therapy of HCV 
induced NANBH. and for saeening for infected blood and blood-related products. 

25 [0074] The availability of HCV/J 1 cDN A sequences permits the construction of expression vectors encoding antigen- 
ically active regions of the polypeptide encoded in either strand. These antigenically active regions may be derived from 
coat or envelope antigens or from core antigens, or from antigens which are non-structural including, for example, poly- 
nucleotide binding proteins, polynucleotide polymerase(s). and other viral proteins required for the replication and/or 
assembly of the virus partide. Fragments encoding the desired polypeptides are derived from the cDNA clones using 

30 conventional restriction digestion or by synthetic methods, and are ligated Into vectors which may, for exanple. contain 
portions of fusion sequences such as beta-galactosidase or superoxide dismutase (SOD). Methods and vectors which 
are useful for the production of polypeptides which contain fusion sequences of SOD are described in EPO Pub. No. 
1 96,056. Vectors encoding fusion polypeptides of SOD and HCV polypeptides are described in EPO Pub. No. 318.216. 
Any desired portion of the HCV cDNA containing an open reading frame, in either sense strand, can be obtained as a 

35 recombinant polypeptide, such as a mature or fusion protein. Alternatively, a polypeptide encoded in the cDNA can be 
provided by chemical synthesis. 

[0075] The DNA encoding the desired polypeptide, whether in fused or mature form, and whether or not containing a 
signal sequence to permit seaetion, may be ligated into expression vectors suitable for any convenient host Both 
eukaryotic and prokaryotic host systems are presently used in fonning recombinant polypeptides, and a sunvnary of 

40 some of the more common control systems and host cell is given below. The polypeptide produced in such host cells 
is then isolated from lysed cells or from the culture medium and purified to the extent needed tor its intended use. Puri- 
fication may be by techniques known in the art. for example, differential extraction, salt fractionation, chromatography 
on ion exchange resins, affinity chromatography, centrifugation, and the like. See, for exanple. Methods in Enzvmology 
for a variety of methods tor purifying proteins. 

45 [0076] Such recombinant or synthetic HCV polypeptides can be used as diagnostics, or those which give rise to neu- 
tralizing antibodies may be formulated into vaccines. Antibodies raised against these polypeptides can also be used as 
diagnostics, or for passive immunotherapy. In addition, anttoodies to these polypeptides are useful for isolating and 
identifying HCV particles. 

[0077] The HCV antigens may also be isolated from HCV virions. The virions may be grown in HCV infected cells in 

so tissue culture, or in an infected host 

[007B] While the polypeptkJes of the present invention may comprise a substantially complete viral domain, in many 
applications all that Is required Is that the polypeptide comprise an antigenic or immunogenic region of the virus. An 
antigenic region of a polypeptide is generally relatively small-typically 8 to 10 amino acids or less in length. Fragments 
of as few as 5 amino acids may characterize an antigenic region. These segments may correspond to regions of 

55 HCV/J1 epitopes. Accordingly, using the cDNAs of HCV/J1 as a basis. DNAs encoding short segments of HCV/J1 
polypeptides can be expressed recombinantly either as fusion proteins, or as isolated polypeptides. In addition, short 
amino add sequences can be conveniently obtained by chemical synthesis. 

[0079] In instances wherein the synthesized polypeptide is correctly configured so as to provide the con-ect epitope. 
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but is too small to be immunogenic, the polypeptide may be linked to a suitable canier. A number of techniques for 
obtaining such linkage are known in the art. including the fbrnvition of disulfide linkages using N-succinimidyl-3-(2-pyri- 
dyl-thio)propionale (SPDP) and suocinimidyl 4-(N-maIeimido-methyOcyck)hexane-1-carbQxyiate (SMCC) obtained 
from Pierce Company. Rockfbrd. Illinois, Cif the peptide lacks a sulfh^ndryl group, this can be provided by addition of a 
cysteine reskiue.) These reagents create a disulfide linkage between themselves and peptide cysteine residues on one 
protein and an anrude linkage through the epsilon-amino on a lysine, or other free amino group in the other. A variety of 
such disulf ide/amid&-forming agents are known. See. for example, tmmun. Rev. (1982) S:185. Other bifunctional cou- 
pling agents form a thioether rather than a disulfide linkage. Many of these thio-ether-forming agents are commercially 
available and include reactive esters of 6-maleimidocaproic ackJ. 2-bromoacetk; acid, 2Hodoac6tic add, 4-(N-maleim- 
ido-methyl)cyclohexan6-1-cartx3Kylic acid, and the like. The cartxixyl groups can be activated by combining them with 
sucdnimkie or 1-hydrQxyl-2-nitro-4-sulfonic acid, sodium salt. Additional methods of coupling antigens employs the 
rotavirusTbinding peptide" system described in EPO Pub. Na 259.149. the disdosure of which is incorporated herein 
by referenca The foregoing list is not meant to be exhaustive, and modifications of the named compounds can dearly 
be used. 

[0080] Any carrier may be used whk^h does not itself induce the production of antibodies hannful to the host Suitable 
carriers are typically large, slowly metabolized maaomolecules such as proteins; polysaccharides, such as latex func- 
tionalized sepharose. agarose, cellulose, cellulose beads and the like; polymeric amino adds, such as polygtutamic 
acid, polylysine. and the like; amino add copolymers; and inactive vims partides. Espedally useful protein substrates 
are serum albumins, keyhole limpet hemocyanin. immunoglobulin molecules, thyroglobulin. ovalbumin, tetanus toxoki. 
and other proteins well known to those skilled in the art. 

[0081 ] In addition to full-length viral proteins, polypeptides comprising tnjncated HCV amino add sequences encod- 
ing at least one viral epitope are useful immundogical reagents. For example, polypeptides comprising such truncated 
sequences can be used as reagents in an immunoassay. These polypeptkles also are candidate subunit antigens in 
conpositions for antiserum production or vaodnes. While these truncated sequences can be produced by various 
known treatments of native viral protein, it is generally prefen-ed to make synthetic or recombinant polypeptides conrv 
prising an HCV sequenca Pdypeptides coarprising these truncated HCV sequences can be made up entirely of HCV 
sequences (one or more epitopes, either contiguous or noncontiguous), or HCV sequences and heterologous 
sequences in a fusion protein. Useful h^erologous sequences include sequences that provkle for secretion from a 
recombinant host, enhance the immunological reactivity of the HCV epitope(s). or fadlitate the coupling of the polypep- 
tide to an immunoassay support a a vacdne canier. See. ag.. EPO Pub. Na 1 1 6.201 ; US. Pat. Na 4,722,840; EPO 
Pub. No. 259,149: U.S. Pat. No. 4.629,783, the disclosures of which are incorporated herein by referenca 
[0082] The size of polypeptkJes conprising the tmncated HCV sequences can vary widely, the minimum size being 
a sequence of sufftoient size to provide an HCV epitope, while the maximum size is not critical. In some applications, 
the maximum size usually is not substantially greater than that required to provide the desired HCV epitopes and func- 
tion(s) of the heterologous sequence, if any. Typically, the truncated HCV amino acid sequence will range from about 5 
to about 100 amino adds in length. More typk»Ily, however, the HCV sequence will be a maximum off about 50 amino 
ackis in length, preferably a maximum of about 30 amino adds. It is usually desirable to select HCV sequences of at 
least about 10, 12 or 15 amino adds, up to a maximum of about 20 or 25 amino ackJs. 

[0083] Truncated HCV amino add sequences comprising epitopes can be Certified in a mrrber of ways. For exam- 
ple, the entire viral protein sequence can be screened by preparing a series of short peptkJes that together span the 
entire protein sequenca By starting with, for example. 100-mer polypeptides, it would be routine to test each polypep- 
tkle for the presence of epftope(s) showing a desired reactivity, and then testing progressively smaller and overlapping 
fragments from an Klentified 100-mer to map the epitope of interest. Screening such peptides in an immunoassay is 
within the skill of the art. It is also known to can^y out a computer analysis of a protein sequence to Wentify potential 
epitopes, and then prepare oligopeptides comprising the identified regions for screening. It is appreciated by those of 
skill in the art that such computer analysis of antigenicity does not always identify an epitope that actually exists, and 
can also incorrectly identify a region of the protein as containing an epitope. 

[0084] The observed relationship of the putative polyproteins of HCV and the Raviviruses allows a prediction of the 
putative domains of the HCV "non-structural" (NS) proteins. The locations of the indivkiual NS proteins in the putative 
Flavivims precursor polyprotein are fairly well-known. Moreover, these also coinckle with obsen^ed gross fluctuations in 
the hytfrophobidty profBe of the polyprotein. It is established that NS5 of Raviviruses encodes the virion pdymerase, 
and that NSI con-esponds with a complement fixation antigen which has been shown to be an effective vaccine in ani- 
mals. Recentiy, it has been shown that a f laviviral protease function resktes in NS3. Due to the obsen^ed similarities 
between HCV and the Raviviruses, deductions concerning the approximate locations of the corresponding protein 
domains and functions In the HCV pdyprotein are possible. Rgure 1 1 is a schematic of putative domains of the HCV 
polyprotein. The expression of polypeptides containing these domains in a variety of recombinant host cells, including, 
for example, bacteria, yeast, insect, and vertebrate cells. shouW give rise to important immunological reagents whk:h 
can be used for diagnosis, detection, and vacdnes. 
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[0085] Although the non-structural protein region of the putative polyproteins of the HCV isolate described herein and 
of Roviviruses appears to be generally similar, there is less similarity between the putative structural regions which are 
towards the N-terminus. In this region, there is a greater divergence in sequence, and in addtion. the hydrophobic pro- 
file of the two regions show less similarity. This "divergence" begins in the N-terminal region of the putative NS1 domain 
5 in HCV. and extends to the presumed N-terminu& Nevertheless, it is still possible to predict the approximate locations 
of the putative nudeocapsid (N-terminal basic domain) and E (generally hydrophobic) domains within the HCV polypro- 
tein. Rom these predctions it may be possible to Identify approximate regions of the HCV polyprotein that oouM corre- 
spond with useful imrrunologtcal reagents. For example, the E and NS1 proteins of Baviviruses are known to have 
efficacy as protective vaccines. These regions, as well as some which are shown to be antigenic in the HCV1 . for exam- 
10 pie those within putative NS3. C. and NS5, etc., should also provide diagnostic reagents. 

[0086] The immunogenicrty of the HCV sequences may also be enhanced by preparing the sequences fused to or 
assembled with partide-forming proteins such as. for exarrple. hepatitis B surface antigen or rotavirus VP6 antigen. 
(Constructs wherein the HCV epitope is linked directiy to tiie partide-forming protein coding sequences produce hybrids 
which are immunogenic with respect to the HCV epitope. In addition, all of ttie vectors prepared indude epitopes spo- 
ts dfic to HBV. having various degrees of irrvnunogenicity. such as. for exanple. the pre-S peptida Thus, partides con- 
structed from partide forming protein which indude HCV sequences are imnunogenic with respect to HCV and 
partide-form protein. See. e.g.. U.S. Pat. Na 4,722,840; EPO Pub Na 1 75,261 ; EPO Pub. Na 259, 1 49; Michelle et al. 
(1984) Int. Symposium on Viral Hepatitia 

[0087] Vaodnes may be prepared from one or nrxx-e immunogenk; polypeptides derived from HCVAil . The observed 
20 homok>gy between HCV and Raviviruses provkjes information conceming tiie polyp^des which are likely to be nrx)st 
effective as vacdnes. as well as the regions of the genome in which they are encoded. The general structure of the Ra- 
vivirus genome is discussed in Rice et al. (1986) in THE VIRUSES: THE TDQAVIRIDAE AND FLAVIVIRIDAE (Series 
eds. Fraenkel-Conrat and Wagner, Vol eds. Schlesinger and Schlesinger. Plenum Press). TTiefiavivirus genomic RNA 
is believed to be the only vims-specific mRNA spedes, and it is translated into the three viral structural proteins, i.e.. C, 
25 M, and E. as wen as two large nonstructural proteins. r^4 and NV5. and a complex set of smaller nonstructural pro- 
teins. It is known that major neutralizing epitopes for Raviviruses reskJe in the E (envefope) protein. Roehrig (1986) in 
THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE (Series eds. Fraenkel-Conrat and Wagner, Vbl eds. Schles- 
inger and Schlesinger. Plenum Press). The corresponding HCV E gene and polypeptide encoding region way be pre- 
dicted, based ipon the homology to Raviviruses. Thus, vaccines may be comprised of recombinant polypeptides 
30 containing epitopes of HCV E. These polypeptides may be expressed in bacteria, yeast or mammalian cells, or alter- 
natively may be isolated from viral preparations. It is also anticipated ttiat the other structural proteins may also contain 
epitopes which give rise to protective anti-HCV anttoodies. Thus, polypeptxles containing tiie epitopes of E. C, and M 
may also be used, whether singly or in combination, in HCV vacdnes. 

[0088] In addition to the above, it has been shown that immunization with NS1 (nonstructural protein 1 ), results in pro- 
35 tection against yelfow fever. Schlesinger et al (1986) J. Virol. gQ:1 153. This is true even though the immunization does 
not give rise to neutralizing antibodies. Thus, particularly since this protein appears to be highly conserved among Ra- 
viviruses. it is likely that HCV NSl will also be protective against HCV infection. Moreover, it also shows that nonstruc- 
tural proteins may provkle protection against viral pathogenicity, even if they do not cause the production of neutralizing 
antibodies. 

40 [0089] In view of the above, multivalent vacdnes against HCV may be comprised of one or more epitopes from one 
or more structural proteins, arxl/or one or more epitopes from one or more nonstructural proteins. These vacdnes may 
be comprised of. for example, recombinanl HCV polypeptides and/or polypeptkles isolated from the virions. In particu- 
lar, vacdnes are contemplated comprising one or more of the following HCV proteins, or subunit antigens derived there- 
from: E, NS1, C. NS2, NS3, NS4 and NS5. Particulariy preferred are vacdnes comprising E and/or NSl. or subunits 

45 thereof. In addition, it may be possble to use inactivated HCV in vacdnes; inactivation may be by the preparation of viral 
lysates. or by otiier means known in the art to cause inactivatfon of Raviviruses. for example, treatment with organic 
solvents or detergents, or treatment with formalin. Moreover, vaccines may also be prepared from attenuated HCV 
strains or from hybrid viruses such as vacdnia vectors known in the art [Brown et al. Nature US' 549-550 (1986)]. 
[0090] The preparation of vaccines whk:h contain immunogenk; polypeptide(s) as active ingredients is known to one 

so skilled in the art. Typically, such vacdnes are prepared as irijectables. either as liquxi solutions or suspensfons; solkJ 
forms suitable for solution in, or suspension in. liquki prior to injection may also be prepared. The preparation may also 
be emulsified, or tiie protein encapsulated in liposomes. The active immunogenic ingredients are often mixed witti 
exdpients which are pharmaceutically acceptable and compatble witii the active ingredient. Suitable exdpients are, for 
example, water, saline, dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, if desired, the vac- 

55 cine may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, 
and/or adjuvants which enhance the effectiveness of the vacdne. Exarrples of adjuvants which may be effective include 
but are not limited to: aluminum hydroxkie, N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor- 
muramyl-L-alanyl-D-isoglutamine (CQP 11637. referred to as nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglulaminyl-L- 
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alanine-a-CI'-a'^lipalmitoyt-sn-glycerDl-S-^ (CGP 19835A. referred to as MTP-PE), 

and RIBI. which contains three components extracted from t>acteria, monophosphoryl Iqpid A, trehalose dimycolate and 
cell wall skeleton (MPUTDM^^^WS) in a 2% squalene/Tween 80 emulsion. The effectiveness of an acQuvant may be 
determined by measuring the amount of antibodies directed against an immunogenic polypeptide containing an HCV 
5 antigenic sequence resufting from administration of this polypeptide in vaccines which are also conrprised of the various 
adjuvants. 

[0091 ] The vaccines are conventionally administered parenterally. by injection, usually, either subcutaneousty or intra- 
musculariy Additional formulations which are suitable for other modes of administration include suppositories and. In 
some cases, oral formulations. For suppositories, traditional binders and carriers may Include, for example, polyalkylene 

10 glycols or triglycerides; such suppositories may be formed from mixtures containing the active ingredient in ttie range 
of 0.5% to 10%. preferably 1%-2%. Oral fomnulations include such nomnally enployed exdpients as. for exanple, phar- 
maceutical grades of mannitol. lactose, starch, magnesium stearate. sodium saccharine, cellulose, magnesium cartwn- 
ate, and the lika These compositions take the form of solutions. suspenskMis. tablets, pills, capsules, sustained release 
formulations or powders and contain 10%-95% of active ingredient preferably 25%-70%. 

IS [0092] The proteins may be fonnulated into the vaccine as neutial or salt forms. Pharmaceutically acceptable salts 
include the add addition salts (fonned with free amino groups of the peptide) and wWch are fbnned with inorganic adds 
such as. for example, hydrochforic or phosphoric acids, or such organic acids such as acetic, oxalic, tartaric, maleic. 
and the lika Salts Ibmied with the free carboxyl groups may also be derived from inorganic bases such as. for example, 
sodium, potassium, ammonium, caldum, or ferric hydroxkies, and such organfo bases as isopropylamine, trimethyl- 

20 amine, 2-ettTy1amino ethanol. histidine, procaine, and the like. 

[0093] The vacdnes are administered in a nranner oompatfole with the dosage formulation, and in such amount as 
will be prophylactically and/or therapeutically effective. The quantity to be administered, which is generally in the range 
of 5 mkaDgrams to 250 micrograms of antigen per dose, depends on the subject to be treated, capacity of the subject*s 
immune system to synthesize antitxxjies, and the degree of protection desired. Predse amounts of active ingredient 

25 required to be administered may depend on the judgment of the practitioner and may be peculiar to each subject 
[0094] The vaodne may be given in a single dose schedule, or preferably in a multiple dose schedule A multiple dose 
schedule is one in whk:h a primary course of vaodnation may be witti 1-10 separate doses, fblfowed by ottier doses 
given at subsequent time intervals required to maintain and or reenfbrce the immune response, for exanple, at 1-4 
monttis for a second dose, and if needed, a subsequent dose(s) after several months. The dosage regimen will also, at 

30 least in part, be determined by the need of ttie individual and be dependent upon ttie judgment of the practitioner. 
[0095] In addition, ttie vaccine containing ttie immunogenk: HCV antigen(s) may be administered in conjunction witti 
ottier immunoregulatory agents, for example, immune globulins. 

[0096] The immunogenic polypeptides prepared as described above are used to produce antibodies, both polydonal 
and monoclonal. If polyclonal antibodies are desired, a selected mammal (e.g., mouse, rabbit, goat, horse, etc.) is 

35 immunized witti an immunogenfe polypeptide bearing an HCV epitope(s). Serum from ttie immunized animal is col- 
lected and freated aocorcfing to known procedures. If serum containing pdydonal antibodies to an HCV epitope con- 
tains antibodies to ottier antigens, the polydonal antibodies can be purified l3y immunoaffinity chromatography. 
Techniques for producing and processing polydonal antisera are known in ttie art. see for example. Mayer and Walker, 
eds. (1987) IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press. Lx)ndon). 

40 [0097] Monoclonal antilxxlies directed against HCV epitopes can also be readily produced by one skilled in the art 
The general mettioddogy for making monodonal antfoodies by hybridomas is well known. Immortal antibody-produdng 
cell lines can be created by cell fusfon, and also by ottier technkiues such as direct transformation of B lymphocytes 
witti oncogenic DNA, or tiBnsfection witti Epstein-Banr vims. See. e.g.. M. Schreier et al. (1980) HYBRIDOMA TECH- 
NIQUES; Hammerling et al. (1981), MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS; Kennetl et al. (1980) 

45 MONOCLONAL ANTIBODIES; ssa SSSQ, U.S. Patent Nos. 4,341,761; 4,399,121; 4.427,783; 4,444,887; 4,466.917; 
4.472.500; 4.491.632; and 4.493.890. Panels of monodonal antibodies produced against HCV epitopes can be 
screened for various properties; i.a, for isotype. epitope affinity, etc. 

[0098] Antfoodies, botti monodonal and polydonal. which are directed against HCV epitopes are particulariy useful 
in diagnosis, and those which are neutralizing are useful in passive immunotherapy. Monodonal antibodies, in particu- 

so lar, may be used to raise anti-kJiotype antOtxxfies. 

[0099] Anti-kifotype anttoodies are immunoglobulins which carry an Internal image" of the antigen of the infectious 
agent against whteh protection is desired. Technkiues for raising anti-idotype antibodes are known in ttie art See. e.g.. 
Grzych (1985). Nature aifi:74; MacNamara et al. (1984). Sdence 226:1325, Uytdehaag et al (1985), J. Immunol. 
134:1225. These anti-Wiotype antibodies may also be useful for treatment and/or diagnosis of NANBH, as well as for 

55 an eluddation Of tile immunogenic regions Of HCV antigens. 

[0100] Using ttie HCV/J1 polynudeotide sequences as a basis, oligomers of approximately 8 nudeotides or more can 
be prepared, eittier by exdsion or syrthetically, which hybrWize witti the HCV genome and are useful in Wentiffeation of 
ttie viral agent(s), f urttier characterization of ttie viral genome(s), as well as in detection of ttie virus(es) in diseased indi- 
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viduals. The probes for HCV polynucleotides (natural or derived) are a length which allows the detection of unique viral 
sequences ty hytxidization. While 6-8 nucleotides may t>e a workable length, sequences of atx)ut 10-12 nucleotides 
are prefen^ed, and atx)ut 20 nucleotides appears optimal. These probes can be prepared using routine methods, includ- 
ing automated oligonucleotide synthetic methods Among useful probes, for eocample, are the clones dsdosed herein, 
5 as well as the various oligomers useful in probing cDNA Ibraries. set forth below. A complement to any unique portion 
of the HCV genome will be satisfactory. For use as probes, complete complementarity is desirable, though it may be 
unnecessary as the length of the fragment is increased. 

[0101] For use of such probes as diagnostics, the biological sample to be analyzed, such as blood or serum, may be 
treated, if desired, to extract the nucleic acids contained therein. The resulting nucleic add from the sample may be sub- 

10 jected to gel electrophoresis or other size separation techniques; alternatively, the nucleic add sanple may be dot blot- 
ted without size separation. The probes are then labeled. Suitable labels, and methods for labeling probes are known 
in the art, and indude. for example, radioactive labels incorporated by nick translation or Wnasing. biotin. fluorescent 
probes, and chemiluminescent probes. The nucleic adds extracted from the sarrple are then treated with the labeled 
probe under hybridization conditions of suitable stringendes. Usually high stringency conditions are desirable in order 

IS to prevent false positives. The stringency of hybridization is determined by a number of factors during hybridization and 
during the washing procedure, induding temperature, ionic strength, length of time, and concentration of formamide. 
These factors are outiined in, for example, Maniatis. T (1982) MOLECULAR CLONING; A LABORATORY MANUAL 
(Cold Spring Harbor Press. CoM Spring Harbor, N.Y.). 

[0102] Generally, it is expected ttiat the' HCV genome sequences will be present in semm of infected individuals at 

20 relatively low levels, i.e., at approximalely 10^-10^ chimp infectious doses (CID) per ml. This level may require that 
amplif wation techniques be used in hybridization assays. Such techniques are known in the art For exarrple, the Enzo 
Biochemical Corporation "Bio-BrkJge" system uses terminal deoxynudeotide transferase to add unmodified 3'-poly-dT- 
tails to a DNA probe. The poly dT-tailed probe is hybridized to the target nudeotide sequence, and then to a biotin-mod- 
ified pdy-A. PCT Apjx No. 84/03520 and EPO Pub. Na 124,221 describe a DNA hybridization assay in which: (1) ana- 

2S lyte is annealed to a single-stranded DNA probe that is complementary to an enzyme-labeled oligonudeotide; and (2) 
the resulting tailed duplex is hybrWized to an enzyme-labeled oligonudeotide. EPO Pdb. No. 204,510 descri>es a DNA 
hybridization assay in which analyte DNA is contacted with a probe that has a tail, such as a poly-dT tail, an anplif ier 
strand that has a sequence that hybridizes to ttie tail of the probe, such as a poly-A sequence, and whteh is capable off 
binding a plurality of labeled strands. 

30 [0103] A particulariy desirable technique may first involve amplification of the target HCV sequences in sera approx- 
imately 10.000-lbW, i.e.. to approximately 10® sequences/ml. This may be acconrplished, for exanple, by the polymer- 
ase chain reactions (PGR) technique described which is by SaiW et al. (1986) Nature ^:163, Mullis, US. Patent No. 
4,683,195. and Mullis etal. U.S. Patent No. 4,683,202. The amplified sequence(s) may tiien be detected using a hybrid- 
ization assay which is described in co-pending European Publication Na 317-077 and Japanese application Na 63- 

35 260347, which are assigned to the herein assignee, and are hereby incorporated herein by referenca These hybridiza- 
tion assays, which shouU detect sequences at the level of 10^/ml. utilize nudeic add muftimers which bind to single- 
stranded analyte nudeic add, and which also bind to a muttiplfeity of single-stranded labeled oligonucleotides. A suita- 
ble solution phase sandwich ass^ which may be used with labeled pdynudeotide probes, and the methods for the 
preparation of probes is desaibed in EPO Pub. Na 225,807 which » hereby incorporated herein by reference. 

40 [01 04] The probes can be packaged into diagnostic kits. Diagnostic kits indude the probe DNA, which may be labeled; 
alternatively, the probe DNA may be unlabeled and the ingredients for labeling may be induded in the kit in separate 
containers. The kit may also contain ottier suitably packaged reagents and materials needed for the particular hybrkli- 
zation protocol, for example, standards, wash buffers, as well as instnjctions for conducting the test. 
[Oioq Botti the HCV/J1 polypeptides whteh read immunologfeally with serum containing HCV antaxxlies and ttie 

45 antibodies raised against ttie HCV specif k: epitopes in these polypeptides are useful in immunoassays to detect pres- 
ence of HCV antibodies, or ttie presence of ttie vinjs and/or viral antigens, in biok)gical samples. Design of the immu- 
noassays is subject to a great deal of variation, and a variety of ttiese are known in ttie art An immunoassay for anti- 
HCV antibody may utilize one viral epitope or several viral epitopes. When multiple epitopes are used, ttie epitopes may 
be derived from ttie same or different viral polypeptides, and may be in separate recombinant or natural pdypeptides, 

so or togettier in the same recombinant polypeptkf es. 

[0106] An immunoassay for viral antigen may use. for example, a monodonal antibody directed towards a viral 
epitope, a oontMnation off monodonal antibodies directed towards epitopes of one viral polypeptide, monodonal anti- 
bodies drected towards epitopes of different viral pdypeptides. polydonal antibodies directed towards the same viral 
antigen, polyclonal antibodies directed towards different viral antigens or a combination off monodonal and polyclonal 

55 antibodies. 

10107] Immunoassay protocols may be based, for example, upon competition, or dired reaction, or sandwich type 
assays. Protocols may also, for example, use sdkJ supports, or may be by immunoprecipitation. Most assays involve 
ttie use of labeled antftxxJy or pdypeptide. The labels may be. for example, fluorescent, chemiluminescent. radioactive. 
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or dye molecules. Assays which amplify the signals from the probe are also known. Examples of which are assays 
which utilize biotin and avidin, and enzyme-iabeled and mediated immunoassays, such as EUSA assays. 
[01 08] Typically, an imnrtunoassay for anti-HCV antibody will involve selecting and preparing the test sample, such as 
a biological sample, and then incubating it with an antigenic (i e., epitope-containing) HCV polypeptide mder concfitions 
ttiat allow antigen-antibody complexes to form. Such conditions are well known in the art In a heterogeneous fomiat. 
the polypeptide is bound to a solid sif3port to facilitate separation of the sample from tiie polypeptide after incubation. 
Examples of solid supports that can be used are nitrocellulose, in membrane or microtitor well form, polyvinylchtoride. 
in sheets or microtiter wells, polystyrene latex, in beads or miaotiter plates, polyvinylidine fluoride, known as Immobu- 
lon™, diazotized paper, nylon membranes, activated beads, and Protein A beads. Most preferably, the Dynatech. Immu- 
lon^ 1 mk^otiter plate or tiie 0.25Hnch polystyrene beads, which Spec finished by Predsion Plastic Ball, are used in 
tiie heterogeneous format The solid support is typk^lly washed after separating it from the test sample. In a homoge- 
neous format the test sample is incubated with antigen in solution, under conditions that will precipitate any antigen- 
antibody complexes ttiat are fbmied, as is know in the art. The precipitated complexes are then separated from the test 
sample, for example, by centrifugation. The complexes fbmied comprising anti-HCV antibody are then detected by any 
of a nurTt>er of techniques. Depending on the format the corrplexes can be detected with labeled anti-xenogeneic Ig 
or, if a competitive format is used, by measuring the anrx)unt of bourvl. labeled competing antibody. 
[0109] In immunoassays where HCV polypeptides are the analyte. the test sample, typically a biotogical sairple. is 
incubated with anti-HCV antft)0dies again under conditions that allow the formation of antigen-antikxxly corrplexes. Var- 
ious formats can be employed, such as a "sandwich" assay where antibody bound to a solid support is incubated witti 
the test sarrple; washed;incubated with a second, labeled antibody to the analyte; and the support is washed again. 
Analyte is detected by determining if the second antbody is bound to ttie support In a corrpetitive format which can 
be eittier heterogeneous or homogeneous, a test sample is usually incubated with and antibody and a lat>eled. corrpet- 
ing antigen either sequentially or sinrultaneously. These and ottier formats are well known in ttie art. 
[01 1 0] The Flavivirus model for HCV allows predictions regarding the likely tocation of diagnostic epitopes for the vir- 
ion structural proteins. The C. pre-M. M. and E domains are all likely to contain epitopes of significant potential for 
detecting viral antigens, arxl particulariy for cGagnosis. Similarly, domains of the rKsnsti'uctural proteins are expected to 
contain important diagnostic epitopes (e.g.. NS€ encoding a putative polymerase; and NS1 encoding a putative com- 
plement-binding antigen). Recombinant polypeptides, or viral polypeptides, which include epitopes from ttiese specific 
dontains may be useful for the detection of viral antibodies in infections blood donors and infected patients. In addition, 
antibodies drected against the E and/or M proteins can be used in immunoassays for the detection of viral antigens in 
patients witti HCV caused NANBH, and in infectious bk)od donors. Moreover, these antibodies may be extremely useful 
in detecting acute-phase donors and patients. 

[01 1 1 ] Antigenic regions of tiie putative potyprotein can be mapped and identified by screening ttie antigenicity of bac- 
terial expression products of HCV cDNAs which encode portions off tiie polyprotein. Ottier antigenk; regions of HCV 
may be detected by expressing ttie portions of ttie HCV cDNAS in ottier expression systems, including yeast systems 
and cellular systems derived from insects and vertebrates. In addition, studies giving rise to an antigenicity index and 
hydrophobidty/hydrophilidty profile give rise to information conceming ttie probability of a region's antigenicity. Effident 
detection systems may indude tiie use off panels off epitopes. The epitopes in ttie panel may be consbojcted into one or 
multiple polypeptidea 

[0112] Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are constructed by pack- 
aging tiie appropriate materials, induding the polypeptides of ttie invention containing HCV epitopes or antibodies 
directed against HCV epitopes in suitable containers, along with ttie remaining reagents and materials required for ttie 
conduct of ttie assay (e.g.. wash buffers, detection means like labeled anti-human Ig. labeled anti-HCV. or labeled HCV 
antigen), as well as a suitable set of assay instiiictions. 

[0113] The HCV/J1 nudeotide sequence infamation described herein may be used to gain further information on ttie 
sequence of ttie HCV genomes, and for kJentification and isolation of additional HCV isolates related to J1 . This infor- 
mation, in turn, can lead to additional polynudeotide probes, polypeptides derived from tiie HCV genome, and antibod- 
ies directed against HCV epitopes which would be useful for ttie diagnosis and/or freatment of HCV caused NANBH. 
[0114] The HCV/J1 nudeotide sequence infbrrmtion herein is useful for ttie design of probes for the isolation of addi- 
tional sequences which are derived from as yet undefined regions of ttie HCV genomes from which the J1 sequences 
are derived. For example, labeled probes containing a sequence off approximately 8 or more nudeotides; and preferably 
20 or more nudeotides. which are derived from regions dose to ttie S'-termini or 3'-termini of the femily off HCV cDNA 
sequences disclosed in the examples may be used to isolate overiapping cDNA sequences from HCV cDNA libraries. 
These sequences which overiap the cDNAs in the akx>ve-mentioned clones, but which also contain sequences derived 
from regions of the genome from which ttie cDNA in ttie above mentioned dones are not derived, may ttien be used to 
synttiesize probes for identifrcation of other overiapping fragments which do not necessarily overiap ttie cDNAs 
desCTibed below. Methods tor constructing cDNA libraries are known in the art. See. ag. EPO Pub Na 318.216. It is 
particulariy prefen-ed to prepare libraries from the serum off Japanese and other Asian patients diagnosed as having 
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NANBH demonstrating antibody to HCV1 antigens; these are believed to be the most likely candidates for can-iers of 
HCV/J1 or related isolates. 

[01 1 5] HCV particles may be isolated from the sera from Individuals with NANBH or from cell cultures by any of the 
methods krrown in the art, including for example, techniques based on size discrimination such as sedimentation or 
s exclusion methods, or techniques based on density such as ultracentrifugation in density gradents. or precipitation with 
agents such as polyethylene glycol, or chromatography on a variety of materials such as anionic or cationic exchange 
materials, and materials which bind due to hydrophobictty. 

[01 1 6] A prefen-ed method of isolating HCV particles or antigen is by immunoaff inity columns. Techniques for Immu- 
noaff inity chromatography are known in the art including techniques for affixing antibodies to solid supports so that they 

10 retain their immunoselective activity. The technK|ues may be those in which the antibodies are adsorbed to the support 
(see. for example. Kurstak in ENZYME IMMUNODIAGNOSIS. page 31 -37). as well as those in virtiich the antibodies are 
covalently linked to the support Generally, the techniques are simtiar to those used for covalent linking of antigens to a 
solid support, desaibed above. However, spacer groi^ may be included in the bifunctional coupling agents so that the 
antigen binding site of the anttoody remains accessibl& The antibodies may be monoclonal, or polyclonal, and it may 

15 be desirable to purify the antibodies before their use in the immunoassay. 

[0117] The general techniques used in extracting the genome from a virus, preparing and probing a cOUA library, 
sequencing cfones, constructing expressfon vectors, transforming cells, performing immunological assays such as radi- 
oimmunoassays and ELISA assays, for growing cells in culture, and the like are known in the art and laboratory manu- 
als are available descrfoing these techniques. However, as a general guide, the following sets forth some sources 

20 currently available for such procedures, and for materials useful in carrying them out 

[0118] Both prokaryotk; and eukaryotic host cells may be used for expressfon of desired coding sequences when 
appropriate control sequences which are compatible with the designated host are used. Among prokaryotic hosts. E» 
cqU is most frequently used. Expression control sequences for prokaryotes include promoteiB, optionally containing 
operator portfons. and ribosome binding sites. Transfer vectors compatible with prokaryotk: hosts are commonly derived 

25 from, far example, pBR322. a plasmid containing cperons confening ampidllin and tetracycline resistance, and the var- 
ious pUC vectors, which also contain sequences conferring antibiotk: resistance mariners. These mari^ may be used 
to obtain successful transfonnants by selection. Comnx>nly used prokaryotic control sequences include the Beta-lacta- 
mase (penicillinase) and lactose promoter systems (Chang et al.(1977). Nature 1^:1056, the tryptophan (trp) pro- 
moter system (Goeddel et al. (1980) Nucleic Add Res. 8:4057). and the lambda^erived Pl promoter and N gene 

30 ribosome binding site (Shimatake €A al. (1981) Nature 2^:128) and the hybrid las promoter (De Boer et al. (1983) Proc. 
Natl. Acad. Sd. USA 23g;l28) derived from sequences of the tpa and !a£ UV5 promoters. The foregoing systems are 
particularly compatible with E^ cpli; if desired, other prokaryotk: hosts such as strains of Badllus or Pseudomonas may 
be used, with corresponding control sequences. 

[01 1 9] Eukaryotic hosts indude yeast and mammalian cells in culture systems. Saccharomyces cerflviaae. Saccha- 

35 romyces carisberqensls, Meteielalactis and E^iiapastod&are the most commonly used yeast hosts, and are conven- 
ient fungal hosts. Yeast compatible vectors carry martyrs which permit selection of successful transfonnants by 
conferring prototrophy to auxotrophic mutants or resistance to heavy metals on wiW-type strains. Yeast conpatiWe vec- 
tors nwy enrptoy the 2 micron origin of replication (Broach et al. (1983) Math Enz. 101 :307). the combination of CEN3 
and ARS1 or other means for assuring replication, such as sequences which will result in incorporation of an appropri- 

40 ate fragment into the host cell genoma Control sequences for yeast vectors are known in the art and indude promoters 
for the synthesis of glycolytic enzymes (Hess et al. (1968) J. Adv. Enzyme Eng. 7:149; Holland et al. (1978), J. Bfol. 
Chem. 2^:1385). induding the promoter for 3 phosphoglycerate kinase (Hitzeman (1980), J. Biol. Chem. 255:2073). 
Terminators may also be included, such as ttiose derived from the enolase gene (Hdland (1981). J. Biol. Chem. 
256:1385). Particulariy useful control systems are those which comprise the glyceraUehyde-3 phosphate dehydroge- 

45 nase (GAPDH) promoter or ateohol dehydrogenase (ADH) regulatable promoter, terminators also derived from 
QAPDH, and if secretion is desired, leader sequence from yeast alpha factor. In addition, the transcriptional regulatory 
region and the transcriptional initiation region which are operaWy linked may be such tttat they are not naturally assod- 
ated in the wild-type organism. These systems are desatoed in detail in EPO Pub. Na 1 20.551 ; EPO Pub. No. 1 1 6,201 ; 
and EPO Pub. Na 1 64,556 all of which are incorporated herein by reference. 

50 [0120] Mammalian cell lines available as hosts for es^ression are known in the art and indude many immortalized 
cell lines available from the American Type Culture Cdlection (ATCC), induding HeLa cells. Chinese hamster ovary 
(CHO) cells, baby hamster kidney (BHK) cells, and a number of other cell linea Suitable promoters for mammalian cells 
are also known in the art and include viral promoters such as that from Simian Virus 40 (SV40) (Rers (1978), Nature 
23:1 1 3). Rous sarcoma virus (RSV). adenovirus (ADV). and bovine papilfoma virus (BPV). Mammalian cells may also 

55 require tenminator sequences and poly A addition sequences: enhancer sequences which inaease expressfon may 
also be induded, and sequences which cause amplification of the gene may also be desirable. These sequences are 
known in the art Vectors suitable for replication in mammalian cells may include viral replicons. or sequences whfoh 
insure integration of the appropriate sequences encocfing NANBV epitopes into tiie host genome. 
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[0121] The vaccinia virus system can also be used to express foreign DNA in manmalian cells. To express heterolo- 
gous genes, the foreign DNA is usually inserted into the thymidine kinase gene of the vaccinia virus and then infected 
ceils can be selected. This procedure is known in the art and further information can be found in these references 
[Macketl et al. J. Virol. 42: 857-864 (1984) and Chapter 7 in DNA Cloning. Vol. 2, IRL Press]. 
[0122] In addition, virai antigens can be expressed in insect cells by the Baculovirus system. A general guide to bac- 
ulovirus expression by Summer and Smith is A Manual of Methods Ibr Baculo virus Vectors and Insect Cell Culture Pro- 
cetflireg (Texas Agricultural Experinient Station Bulletin Na 1555). To incorporate the heterologous gene into the 
Baculovirus genome the gene is first doned into a transfer vector containing some Baculovirus sequences. THis trans- 
fer vector, when it is cotransfected with wild-type virus into insect cells, will recombine with the wiW-type virus. Usually, 
the transfer vector will be engineered so that the heterotogous gene will disrupt the wild-type Baculovirus polyhedron 
gena This disruption enables easy selection of the recombinant virus since the cells infected with the recomtxnant virus 
will appear phenotypically different from the cells infected with the wiW-type viaia The purified recombinant virus can 
be used to infect ceils to express the heterologous gene. The foreign protein can be secreted into the medium if a signal 
peptide is linked in frame to the heterologous gene; othenwise. the protein will be bound in the ceil lysates. For further 
information, see Smitti ^ al Mol. & Cell. Bid. 32156-2165 (1983) or Luckow and Summers in Virology IZ: 31-39 (1989). 
[0123] Transformation may be by any known m^od for introducing polynucleotides into a host cell, including, for 
example packaging tiie polynucleotide in a virus and transducing a host cell with the virus, and by direct uptake of the 
polynudeotWa The transformation procedure used depends upon the host to be transformed. Bacterial transformation 
by direct uptake generally employs treatment with calcium or rubidium chloride (Cohen (1972). Proc. Natf. Acad. Sci. 
USA 6251 10; Maniatis et al. (1982), MOLECULAR CLONING; A LABORATORY MANUAL (CoW Spring Harbor Press. 
Cold Spring Harbor, N.Y). Yeast transfbrnration by direct uptake may be canied out using the method of Hinnen et al. 
(1978) Proc. Natf. Acad. Sd. USA Z5; 1929. Mammalian transformations by direct uptake may be conducted using the 
calcium phosphate precipitation method of Graham and Van der Eb (1 978), Virotogy 52:546 or the various known mod- 
ifications thereof. 

[01 24] Vector construction employs technk^ues which are known in the art Site-specific DNA deavage is performed 
by treating with suitable restriction enzymes under conditions which generally are specified by tiie manufacturer of 
these commerdally available enzymes. The deaved fragments may be separated using polyacrylamkie or agarose gel 
electrophoresis technkjues, aocortfing to the general procedures found in Methocte in Enzymotogy (1980) 65:499-560. 
Sticky ended deavage fragments may be blunt ended using E. coli DNA polymerase I (Wenow) in the presence of tiie 
appropriate deoxynudeotide triphosphates (dNTPs) present in ttie mixture. Treatment with SI nudease may also be 
used, resulting in the hydrolysis of any single stranded DNA portions. 

[0125] Ligations are carried out using standard buffer and temperature conditions using T4 DNA ligase and ATP; 
sticky end ligations require less ATP and less ligase than Uunt end ligations. When vedor fragments are used as part 
of a ligation mixture, the vector fragment is often treated with bacterial alkaline phosphatase (BAP) or calf intestinal 
alkaline phosphatase to remove the 5'-phosphate and thus prevent religation of the vector; alternatively, restriction 
enzyme cfigestion of unwanted fragments can be used to prevent ligation. Ligation mixtures are transformed into suita- 
ble doning hosts, such as E fidl and successful transfomiants selected by. for exarvple, antibiotic resistance, and 
screened for tiie correct construction. 

[01 26] Syrthetic oligonudeotides may be prepared using an automated oligonudeotkJe synthesizer as described by 
V\famer (1984), DNA 3:401. If desired, the synthetic strands may be labeled with ^P by treatment with polynudeotide 
kinase in the presence of ^P-ATP. using standard concfitions for the reaction. DNA sequences, induding those isolated 
from cDNA libraries, may be modified by known techniques, induding. for example site directed mutagenesis, as 
desaibed by Zoller (1982). Nucleic Ackls Res. ia:6487. 

[0127J DNA libraries may be probed using the procedure of Grunstein and Hogness (1975). Proc. hteti. Acad. Sd. 
USA za:3961. Briefly, in this procedure, tiie DNA to be probed is immobilized on nitrocellulose filters, denatured, and 
prehybridized vwth a buffer. The percentage of formamide in the buffer, as well as the time and temperature conditions 
of tfie prehybrWization and subsequent hybridization steps depends on the stringency required. Oligomeric probes 
whwh require tower stringency conditions are generally used with low percentages of formamkie. lower tenperatures, 
and longer hybricfization times. Probes containing more than 30 or 40 nudeotides such as those derived from cDNA or 
genomk: sequences generally employ higher temperatures, e.g.. about 40-42*C, and a high percentage. e.g., 50%, for- 
mamida Following prehybrkJization, 5'-^P-labeled oligonudeotide probe is added to the buffer, and the filters are incu- 
bated in this mixture under hybridization conditions. After washing, the treated filters are subjected to autoradiography 
to show the location of the hybricfized probe: DNA in corresponding locations on the original agar plates is used as tiie 
source of the desired DNA. 

[01 28] An enzyme-linked imnrujnosorbent assay (EUSA) can be used to measure ather antigen or anttoody concen- 
trations. Tills method depends upon conjugation of an enzyme to eittier an antigen or an antibody, and uses tiie bound 
enzyme activity as a quantitative label. To measure antibody, the known antigen is fixed to a soW phase (e.g., a micro- 
plate or plastic cup), incutwted with test serum dilutions, washed, incubated witii anti-immunoglobulin labeled witti an 
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enzyme, and washed again. Enzymes suitable for labeling are known in the art. and include, for example, horseradish 
peroxidasa Enzyme activity bound to the solid phase is measured by adding the specific substrate, and determining 
product formation or substrate utilization colorimetrically. The enzyme activity bound is a direct function of the amount 
of antibody bound. 

5 [0129] To measure antigen, a known specific antibody is fixed to the solid phase, the test material containing antigen 
is added, after an incubation the solid phase is washed, and a second enzyme-labeled antftxxly is added. After wash- 
ing, substrate is added, and enzyme activity Is estimated cotorimetrically. and related to antigen concentration. 

Examples 

10 

1 

[0130] This exanple descrtoes the cloning of the HCV/J1 and HCV/J7 nucleotide sequencea 

[0131] Both blood samples whfeh were used as a source of HCV virions were found to be positive in an anti-HCV 

IS antibody assay. THe HCV isolates from these samples were named HCV/J1 and HCV/J7. The infectivity of the blood 
sample containing the J1 isolate was confirmed by a prospective study of blood transfusion redpients. Dr. Tohru 
Katayama from the Department of Surgery at the National Tokyo Chest Hospital collected blood from patients who have 
contracted post-transfusion non-A, non-B hepatitis. He also collected bfood samples from the respective Wood donors 
of these patients. Next, these samples were assayed for antibodies to the ClOO-3 HCV1 antigen (EPO Pub. No. 

20 318.216), and blood from one of the donors was found to be positive. 

[01 32] Isolation of the Rf^ from the blood samples began by pelleting virions in the bfood sarrple by ultracentrifuga- 
tion [Bradley, D.W.. McCaustland, K.A.. Cook E.H.. Schable. C.A., Ebert, J.W. and Maynard. J.E. (1985) Gastroenterol- 
ogy 773-779]. RNA was then extracted from the pellet by the guanidinium/cesium chloride method [Maniatis T, 
Fritsch, E.F, and Sambrook J. (1982) "Molecular Cloning: A Laboratory Manual", CoW Spring Harbor Laboratory. Cold 

25 Spring Harbor] and further purified by phenol/chloroform extractfon in the presence off urea. [Beri<. A. J. Lee,F.. Han-lson. 
T. Williams. J. and Sharp, PA. (1 979) Cell 17. 935-944]. 

[01 33] Five pairs of synthetic oligonucleotide primers were designed from the C/E, E, E/NS1 . NS3, and NS5 domains 
of the nucleotide sequence of HCV1 to isolate fragments from the J1 and J7 genome. The first set of primers were to 
isolate the sequence from the core and some of the envelope domain. The second set of primers were to isolate the 
30 sequences in the envelope domain. The third set of primers were to isolate a fragment which overiapped the putative 
envelope and non-structural one, NS1 . domains. The fourth and fifth set of primers were used to isolate fragments from 
non-structural domains three and five, NS3 and NS5. The sequences for the various primers are shown befow: 
The sequence of the primers for the C/E region were: 

35 21 S 5' CQTGCCCCCGCAAQACTGCT 3' 
J80A 5' CCGTCCTCCAGAACCCGGAC 3' 

The sequence of the primers for the E region were: 

40 71 S 5' QCCQACCTCATGGGQTACAT3' 
J132A 5' AACTGCQACACCACTAAGGC 3' 

The sequence of the primers for the E/NS1 region were: 

45 127S5' TQGCATQGGATATGATQATG3' 
166A5' TTQAACTTGTQGTQATAQAA3' 

The sequence of the primers for the NS3 region were : 

so 464S5' QGCTATACCGGCGACTTCGA3' 
526A 5' QACATGCATGTCATGATGTA 3' 

The sequence of the primers for the NS5 region were: 

55 870S5' GCTGGAAAGAGGGTCTACTA3' 
917A5' GTTCTTACTGCCCAGTTGAA3' 

[0134] 1 ng of the antisense primers. 1 66A. 526A. or 91 7A. was added to 1 0 units of reverse transcriptase (Bforad) 
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to synthesize cONA fragments from the isolated RNA as the template. The cDNA fragments were then anplified b/ a 
standard polymerase chain reaction [SaiW. R.K., Scharf. S., F^oona. R, Mullls. K.B.. Horn G.T., Erlich. H.A.. and Arn- 
heim, N. (1985) Science 230, 1350-1354] after 1 jig of the appropriate sense primer, 21S. 71S, 127S, 464S or 870S. 
was added. 

[0135] The cDNA fragments amplified by the PGR method were gel isolated and cloned by blunt-end ligation into the 
Smal site of pUCi 19 [Vieira, J. and Messing, J. (1987) Methods in Enzymology 153, 3-1 1] or into the SnaBI site of cha- 
romid SB, a derivative of the cloning vector charomid 9-42 [Saitot I. and Stark, G. (1986) Proa Natl. Acad. Sd. USA 
8664-8668]. Clones which contain the fragments of the five viral domains were successfully constructed. 

U 

[0136] From the PGR reaction off the Japanese isolates, J1 and J7. three independent clones from each region. C/E, 
E, E/NS1, NS3. and NS5, have been sequenced by the dideoxy chain termination method. 
[0137] Sequence from all regions except C/E has been isolated from the J1 isolate. Sequence from only the C/E 
region has been isolated from the J7 isolate. Surprisingly, fragments isolated from both isolates are neither longer or 
shorter than what would be predicted from the HGV1 genome. However, there is heterogeneity between clones con- 
taining sequence from the same region. Consequently, a consensus sequence was constructed for each of the 
domains, C/E, E, E/NS1. NS3 and NS5. as shown respectively in Rgures 1 through 5. These differences may be 
explained as artifacts which occur randomly during the PGR amplification [SaiW, R.K., Scharf, S., Faloona, P., Muilis, 
K.B., Hom, G.T.. Eriich. H.A., and Amheim. N. (1985) Science 1350-1354). Another explanation is that more than 
one virus genome is present in the plasma of a single healthy carrier and that these genomes are heterogeneous at the 
nucleotide level. 

[0138] To clarify this point it was determined hew many of these nucleotide differences would lead to amino add 
changes, using the sequence from the NS3 domain of the J1 isolate as an example. Out of the five nudeotide differ- 
ences, three fall on the third position of the amino add codon and do not change the amino add sequenca Bath of the 
remaining two nudeotide changes fall on the first position of the amino add codon and generate amino add changes 
of threonine to alanine and proline to alanine, all of which are small, neutral amino add residues. Similarty, when ana- 
lyzing the nudeotide differences in other domains, many silent and conserved mutations are found. These results sug- 
gest that nudeotide sequences of the HGV genomes in the plasma of a single healthy donor are heterogeneous at the 
nudeotide level. 

[01 39] In addition, once the consensus sequences for each of the fragments were compiled each sequence was com- 
pared to the HCV1 isolate in Figures 6 through 10. In Figure 6 the fragment from the C/E region of the J7 isolate shows 
a 92.8%, 512/552, nudeotide and 97.4%, 150/154, amino acid homdogytothe HCV1 isolate. The fragment from the E 
domain of J1 shews a slightly lower nudeotide and amino add homdogy to HCV1 in Rgure 7 of 76.2% and 82.9%, 
respectively. The fragment from the J1 isdate which overtaps the envelope and non-structural one domains shows the 
lowest homology to HGV1. as seen in Rgure 8. where the J1 isolate has a 71.5% nudeotide homology and a 73.5% 
amino acid homdogy to HGV1 . Rgure 9 shows a comparison of the fragment from the NS3 domain of J1 to HGV1 . The 
homology between the nucleotides sequences is 79.8%, while the amino add homdogy between ttie isolates is quite 
high. 92.2% or 179/194 amino adds. Figure 10 shows the homology between ttie NS5 sequences from J1 and HGV1 . 
The sequences have a 84.3% nudeotide and 88.7% amino add homology. 

[0140] The vectors descrtoed in the examples above were deposited with ttie Patent Microorganism Depository, Fer- 
mentation Institute, Agency of Industrial Sdence and Technology at 1-3, Higashi 1<homeTsukuba-chi, Ibaragiken305, 
Japan, and will be maintained under ttie provisions of the Budapest Treaty. The accession numbers and dates off ttie 
deposit are listed below, on page 68. 

lU 

[0141] An HGV/J1 done, J1-1519, was isolated using the essentially the techniques described above. However, ttie 
primers used in the isolation were J159S and 199A. The sequences of ttie digomeric primers J159S and 199A, which 
follow. Were based upon those in J1 -121 6 and in HGV1 . 

J159S 5' ACT GCC CTG AAC TGC AAT GA 3' 
199A 5' AAT CCA GTTGAG TTG ATG GA 3' 

[0142] aone J1 -1 51 9 is comprised of an HGV cDNA sequence of 367 nudeotides which spans most of ttie 5'-half of 
the NS1 region and which overiaps ttie E-region done, J1-1216. by 31 nucleotides. Three independent 10 dones span- 
ning ttiis region were sequenced; ttie sequences in ttiis region obtained from ttie ttiree dones were identical. The 
sequence of the HGV cDNA in J1 -121 6 (shown in ttie figure as J1) and ttie amino acids encoded therein (shown above 
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the nucleotide sequence) are shown in Rgure 13. Figure 13 also shows the sequence differences between J1-1216 in 
the comparable region of the prototype HCV1 cDNA (indicated in the figure as PT). and the resulting changes in the 
encoded amino acids. The homology between the JM216 and HCV1 cONA is approxinfately 70% at the nucleotide 
level, and about 75% at the amino add level. 

[0143] A composite of the sequences from the putative core to NS1 region of the J 1 1solate is shown in Rgure 14; 
also shown in the figure are the amino acids encoded in the J1 sequence. The variation from the HCV1 prototype 
sequence is shown in the line t>elow the J1 nucleotide sequence; the dashed lines indicate homologous sequences. 
The nonhomologous amino acid encoded in the HCV1 prototype sequence is shown below the HCV1 nucleotide 
sequence 

[0144] Qoned material containing the J1/1519 HCV cDNA (psl-1519) has been maintained in DhlSa. and deposited 
with the Patent Microorganism Depository. 

\>L 

[0145] Several regions of the J1 isolate, including the C200-C100 region from the putative NS3-NS4 region (which 
encompasses the region encodng the 5-1-1 polypeptide In HSV1 (See EPO Pub. No 318,216). and the putative NS1 
- E region, were amplified using the PGR m^od. The C200-C100 region includes nucleotides 3799 to 5321 of the pro- 
totype HCV1. RNA was extracted as described above, except that extraction was with guanidinlum ttiiocyanate in the 
presence of Proteinase K and sodium dodecylsuHate (SDS) (Maniatis (1982). supra). The RNA was transcribed Into 
HCV cDNA by incubation in a 25 |il reaction comprised of 1 of each primer. 40 units of RNase inhibitor (RNASIN). 
5 units of AMV reverse transcriptase, and salts and buffer necessary for the reaction. Amplification of a segment of the 
HCV cDNA from the designated region was performed utilizing pairs of synthetic oligomer 16-mer primers. PCRanrpli- 
fication was accomplished in three rounds (PCR I. PCR II. and PCRIll). The second and ttiird rounds of PGR amplifica- 
tion (PGR II) utilized different sets of PGR primers; tiie first PGR reaction was diluted lO-fold and nultiple rounds of 
PGR amplification were carried out with the new primers, so that ultimately up to 50% of the products of the first PCR 
reaction (PGR 1) were reamplifled. The primers used for the amplification of the regions were ttie following. These prim- 
ers, with the exception of J1C200-3 which was derived from the J1 isolate sequence, were derived from tiie prototype 
HGV1 sequence. 

Primers for amplification of the "5-1-r region f rom NS3-NS4. 
[0146] 
PCR I 

511/1 6A (sense, derived from nucleotides starting at number 1528 of HCV1) 

5' AAC AGG CTQ CGT GOT C 3* 
511/1 6B (anti-sense, derived from nucleotides ending at 5260 of HCV1) 

5* ACT TGG TCT GGA GAG C 3' 
PCR II 

511/35A (sense, the HCV portion derived from nucleotides starting at number 5057 of HCV1. tiie restriction 
enzyme site Is underiined) 

5* CHQAAnC TGG TCT TGT CGG GGA AGG GGQ GAA TG 3* 

511/35P (anti-sense, ttie HCV portion derived from nucleotides encfing at number 5233 of HCV1 ; the restriction 
enzyme site is urxleriined) 

5' CTTGAATTC GGT CTG CGT GAG GGG ACG CGG TGT GG 3' 
PGRlll 

511fi5A(see supra) 

VSNrc7 (antisense, derived from nucleotides ending at number 5804 of HCV1) 
5' GTA GTG CGT GGQ GGA AAC AT 3* 
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Primers for amplification of the "NS1/E* reoion 
[0147] 
5 PGR I 

JlIESia (sense, the HCV portion derived from nucleotides starting at number 953 of HCV1 . the restriction enzyme 
srte is underiined) 

5* CTTAGAATTC TGG CAT GGG ATA TGA TGA TG 3' 

10 

JIIEH (sense, the HCV portion derived from nucleotides starting at number 1087 of HCV1 , the restriction enzyme 
site is underiined) 

5* CIiaQA£aiC TCC ATG GTQ GGG AAC TGG GC 3' 

15 

Jlrcl2 (anti-sense, the HCV portion derived from nucleotides ending at 1995 of HCV1 . the restriction enzyme site 
is underiined) 

5- CnOAAEIS TAA COG GCT GAG CTC GGA 3* 

20 

sUffiia (anti-sense, the HCV portion derived from nucleotides ending at 1941 of HCV1 . the restriction enzyme site 
is underiined) 

5' CTTAGAATTC CGT CCA GTT GCA GGC AGC TTC 3' 

25 

PCR II 

lUlSLlS (see supra) 

JllZil (sense, the HCV portion is derived from nucleotides starting at number 1641 of HCVI. the restriction 
enzyme site is underlined) 

30 

5' CnQMHC CAA CTG GTT CGQ CTQ TAG A 3' 

(sense, the HCV portion is derived from nucleotides starting at number 1596 of HCVI. the restriction 
enzyme site is underlined) 

35 

5' TGA GAC GGA CGT GCT GCT CCT 3' 
Primers for the C20Q-C100 reoion of the "NSa-NSA" re^lnn 
40 [0148] 

PCR I 

^1g2Q0-1 (sense, derived from nucleotides starting at number 3478 of HCVI) 
45 5'TCCTACTrGAAAGGCTC3' 

JICZOQ-a (anti-sense, derived from nucleotides ending at number 4402 of HCVI) 
5' GGA TCC AAQ CTG AAA TCG AC 3' 

50 

JlT^ (anti-sense, the HCV portion derived from nucleotides ending at 5853 of HCVI . the restriction enzyme site 
is underlined) 

5' Cn^SMEIC GAG GCT GCT GAG ATA GGC AGT 3* 

55 

SllZlM (see above). 
PCR II 

J1C 2 QQ' 2 (sense, the HCV portion derived from nucleotides starting at number 3557 of HCVI. the restriction 
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enzyme site e underlined) 

y CTTGAATTC CCC GTQ GAQ TQQ CJA Afifi TfV^ ap.t 

J1C2QQ-4 (anti-sense, the HCV portion derived from nucleotides ending at 4346 of HCV1. the restriction enzyme 
site is underlined) 

5' CrrgAATTC TCG AAG TCQ CCQ QTA tag CCQ QTC ATQ 3' 
SUZaSA (see above) 

JlroSI (anti-sense, the HCV portion derived from nucleotides ending at 5826 of HCV1 , the restriction enzyme site 
is underlined) 

5' CTTAGAATTC GGC AGC TGC ATC GOT CTC CGG CAC 3' 



The amplified HCV cDNAs were either sequenced directly without cloning, and/a were cloned. Sequencing 
was accomplished using an assymotric PCR technique, essentially as described in Shyamala and Ames. J. Bacte- 
riology 121:1602 (1989). In this technique, amplification of the cDNA is carried out with a limiting concentration of 
one of the primers (usually in a ratio of about 1 :50) in order to get preferential amplification of one strand. The pref- 
erentially amplified strand is then sequenced by the dideoxy chain tenrtination method. 

The primers used for assymetric sequencing by the PCR method were the following. For the NS1 region: J1 IZ- 
1 and J1rc13 (sequenced with both): J1I2-2. Jlrc13 (confirmed on both strands). For the NS3-NS4 region, which 
includes the C200-C100 N-terminal region. C200-C100 C-terminal region, and the 5-1-1 region: J1C200-2 and 
J1C200-7 (for the N-terminal region of C200-C100). and J1C200-4 and J1C200-6 (for the C200-C100 C-terminal 
region); and 51 1/35A and hep 4 (for the 5-1-1 region). The sequences for J1C200-2. J1 C200-4. and 51 1/35A are 
shown supra; the sequences of hep 4, J1 C200-6. and J1 C200-7 are the following, 
hep 4 (derived from nucleotides starting at number 5415 of HCV1) 

5' TT GGC TAG TGG TTA GTG GGC TGG TGA CAG 3' 

J1C200-g (the HCV portion derived from nucleotides starting at number 3875 of HCV1 . the restriction enzyme site 
is underiined) 

5' CTTGAATTC CGT ACT CCA CCT ACQ GC A AGT TCC TT 3' 

J1CgQQ-7 (the HCV portion derived from nucleotides starting at number 3946 of HCV1 . the restriction enzyme site 
is underiined) 

5' CTTGAATTC GTG GCA TCC GTG GAG TGG CAC TCG TC 3' 

[0149] The sequences obtained by assymetric sequendng of the "NSr region, the C200-C100 region, and the 5-1- 
1 region are shown in Figure 15. and Figure 16. respectively. In the figures, the amino acids encoded in the J1 
sequence are shown above the J1 nucleotide sequence. The differences between the J1 sequence and the HCVI pro- 
totype nucleotide sequence is shown below the J1 sequence (the dashes indicate homologous nucleotides in both 
sequences). The encoded amino acids which differ in the HCVI prototype sequence are shown below the HCVI nucle- 
otide sequence. 

10150] HCV cDNAs from the NS1 region, the C200-C100 region, and the 5-1-1 region were cloned. A 300 bp and a 
230 bp fragment from the putative NS1 region, were cloned into a derivative of the commercially available vector. 
PGEM-3Z. in host HB101. and deposited with the ATCC as AW-300bp. The derivative vectors maintain the original 
PGEM-3Z pdyllnkers. an intact Amp' gene, and tiie genes required for replication in E. cqH. The HCV cDNA fragments 
may be removed with Sacl and Xbal. HCV cDNAs containing 770 bp N-terminal fragments of C200 were cloned into 
pMI E in HB1 01 , 12 clones were pooled and deposited with the ATCC as AW-770bp-N; the HCV cONA may be removed 
from the vector with Haell. The resultant Haell fragment will contain vecta DNA of 300 bp and 250 bp at the 5' and 3' 
encte, respectively. HCV cDNAs containing 700 bp C-temtinal fragments of C200 (AW-700bp-C) were cloned into 
M13mp10 and maintained in host DH5a-P; cloning was into ttie vector polylinker sita The resultant phage were 
pooled, and deposited with the ATCC on September 11. 1990 as AW.700bp-N or AW-700bp-C. HCV cDNA from J1 
equivalent to the 5-1-1 region of HCVI was cloned into mp19EcoR1 site, and maintained in DH5a-P. Several m13 
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phage superanants from thte cloning were pooled and deposited with the ATCC as J1 5-1 -1 , on Septenft>er 11.1 990. 
The HCV cONAs may be obtained from the phage by treatment with EcoRI. Accession numbers for J1 5-1-1 and AW- 
700bp-N or AW-700bp-C may be obtained 1:^ telephoning the ATCC at (301) 881 -2600. 
[0151] The abGve-descrOt}ed doned material was deposited with the American Type Culture Collection (ATCC). 

y 

[0152] An HCV cDNA library containing sequences of the putative 'NS1' region of the J1 isolate was created by direc- 
tional cloning In A,-gt22. The "NSr region extends from about nucleotide 1460 to about nucleotide 2730 using the num- 
bering system of the HCVI prototype nucleic add sequence, where nucleotide 1 is the first nucleotide of the initiating 
methionine codon for the putative polyprotein. The doning was accomplished using essentially the method described 
by Han and Rutter In GENETIC ENGINEERINQ. Vol 10 (J.K. Setlow. Ed.. Plenum Piialishing Co.. 1988), except that 
the primers for the synthesis of the first and second strand of HCV cDNA were JHC67 and JHC68, respectively, and the 
source of RNA was the J1 plasma. In this method the RNA Is extracted with guanicfium thiocyanate at a low tempera- 
ture. The RNA is then converted to full length cDNA. which is cloned in a defined orientation relative to the pro- 
moter in X-phaga Using this method, the HCV cDNAs to J1 RNA were inserted into the NotI site of X-gt22. The 
presence of "NS1 " sequences In the Ibrary was detected using as probe. Alx54. 

[0153] The sequence of a region of "NSr downstream from the region shown in Rgure 14, but which overlaps the 
region by about 20 nudeotides. was determined using the assymetric sequendng technique described above, but sub- 
stituting as primers for PCR amplification. Alx 61 and Alx 62. The resulting sequence is shown in Figure 17. (It should 
be noted that the PCR ampllTication was of a region from about nudeotide 1930 to about nudeotide 2340; this region 
is also encompassed in the sequence shown in Rgure 15). The sequences of the primers and probes used to obtain 
the HCV cDNA library in X-gt22. and to sequence the portion of the *NS1 " region were the following. 

JHC67 

5' GACGC QQCCG CCTCC QTQTC CAGCQ CGT 3' 
JHC68 

5' CGTQC QGCCQ CAAQA CTGCT AGCCQ AQQT 3' 
ALX 61 

5' ACCTQ CCACT QTQTA GTGQT CAGCA GTAAC 3' 
ALX 62 

5' ACGGA CGTCT TCGTC CTTAACAATA CCAGG 3' 
ALX 54 

5' GAACT TTGCQ ATCTG GAAGACAQGG ACAGQ 3' 

[0154] A 400 bp fragment of J1 HCV cDNA derived from the sequenced region was doned into pGEM3z and main- 
tained in HB101 : the HCV cDNA may be removed from the vector with SacI and Xbal. Host cells transformed with ttie 
vector (JH-400bp) have been deposited with tiie ATCC. 

[0155] A pooled cDNA Itorary was created from the J1 serum; the pooled library spans the J1 genome and is identified 
as HCV-J1 k glt22. The pooled cDNA library was created by pooling aliquots of 1 1 individual cDNA libraries, whidi had 
been prepared using the directional cloning technique described above, except that the libraries were aeated from 
primers which were designed to yield HCV cDNAs which spanned the genome. The primers were derived from the 
sequence of HCVI , and induded JHC 67 and JHC 68. The HCV cDNAs were inserted into the NotI site of X-gt22. The 
pooled cDNA ISwary. HCV-J1 X glt22, has been deposited with the ATCC. 

yi 

[0156] The sequence of a region of the polynudeotide upstream of ttiat shown in Figure 14 was determined. This 
region begins at nudeotide -267 with respect to the HCV-1 (See Rgure 1 2) and extends for 560 nudeotides. Sequenc- 
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ing was accompfished by preparing HCV cDNA from RNA extracted from J1 serum, and amplifying the HCV cDNA 
using the PGR method. 

[0157] RNA was extracted from 100 ^1 of serum following treatment with proteinase K and sodium dodecylsultete 
(SDS). The sanples were extracted with phenol-cNoroform, and the RNA precipitated with ethanol. 

5 [0158] HCV cDNA from the J1 isolate was prepared by denaturing the precipitated RNA with 0.01 M MeHgOH; after 
ten minutes at room temperature, 2-mercaptoethanol was added to sequester the mercury ions. Immediately, the mix 
for the first strand of cDNA synthesis was added, and incubation was continued tor 1 hr at 37*»C. The conditions for the 
synthesis of the anti-sense strand were the following: 50 mM Tris HCI. pH 8.3. 75 mM KO. 3 mM MgOa. 10 mM dithio- 
threitol. SOOfiM each deoxynudeotide trphosphate. 250 pmo\ specific antisense cDNA primer r25. 250 units MMLV 

10 reverse transcr?rtasa In order to synthesize the second strand (sense), the synthesis reaction conponents were 
added, and incubated for one hour at 1 4*C. The corrponents for the second strand reaction were as follows: 1 4 mM Tris 
HCI, pH 8.3, 68 mM KQ. 7.5 mM ammonium sulfate. 3.5 mM MgC^. 2.8 mM ditWothreitol, 25 units DNA polymerase I. 
and one unit RNase H. The reactions were terminated by heating the samples to 95"C for 1 0 minutes, followed by cool- 
ing on ica 

15 [0159] The HCV cDNA was amplified by two rounds of PCR. The first round was accomplished using 20 jJ of the 
cDNA mix The conditions for the PCR reaction were as follows: 10 mM Tris HQ. pH 8.3. 50 mM KCI, 1.5 mM MgC^, 
0.002% gelating, 200 mM each of the deoxynudeotide triphosphates, and 2.5 units Amplilaq. The PCR thermal cyde 
was as follows: 94«C one minute, 50'*C one minute. 72*»C one minute, repeated 40 times followed by seven minutes at 
72*»C. The second round of PCR was accomplished using nested primers (i.a primers which bound to an internal region 

20 of the first round of PCR amplified product) to increase the spedficrty of the PCR products. One percent of the first PCR 
reactfon was anplified essentially as the first round, except that the primers were substituted, and the second step in 
the PCR reaction was at 60*C instead of 50"C. The primers used for the first round of PCR were ALX90 and r14. The 
primers used for the second round of PCR were r14 and pi 4. 

[01 60] The sequences of the primers for the synthesis of HCV cDNA and fa the PCR method were the following. 

25 

r25 

5' ACCTTACCCAAATTQCQCGACCTA3' 
30 ALX90 

5' CCA TGA ATC ACT CCC CTG TGA GGA ACT A 3' 

r14 

35 

5' GGG CCC CCAG CTA QGC CGA GA 3' 

p14 

40 5'AACTACTGTCTTCACGCAGAAAGC3' 

[0161] The PCR products were gel purified, the material which migrated as having about 615 bp was isolated, and 
sequenced by a modification of the Sanger dideoxy chain termination method, using ^^P-ATP as label. In the modified 
method, the sequence replication was primed using P32 and R31 as primers: the double stranded DNA was melted for 
45 3 minutes at 95*'C prior to replication, and the synthesis of labeled dideoxy tenninated polynudeotides was catalyzed 
by Bst polymerase (obtained from BioRad Corp.). according to the manufacturer's directions. The sequencing was per- 
famed using 500ng to 1 jig of PCR product per sequendng reaction. 

[0162] The primers P32 (sense) and R31 (antisense) were derived from nudeotides -137 to -1 15 and from nude- 
otides 192 to 173. respectively, of the HCV1 sequence. The sequences of the primers are the following. 

so 

P32 primer 

5' AAC CCG CTC AAT GCC TGG AG A TT 3' 
55 R31 primer 

5' GGC CGX CGA GCC TTG GGG AT 3' 
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whereXo AorQ 

[0163] The sequence of the region in the J1 isolate which encompasses the 5'-untranslated region as well as a part 
of the region of the putative "Core" is shown in Rgure 18. In the figure, amino acids encoded in the J1 sequence are 
shown above the nucleotide sequence. Tlie sequence of the prototype HCVI is shown below the J1 sequence; the 
dashes indicate sequence homology with J1. The differing amino acids encoded in the HCVI sequence are shown 
below the HCVI sequence. 

[0164] An HCV cONA fragment which is a representative of the 600 bp J1 sequence described above (TC 600bp) 
was cloned into pQEM3Zand maintained in host HB101 ; the HCV cDNA fragment maybe removed with Sad and Xbal. 
TTiis material is on deposit with the ATCC. 

[01 65] Patent Microorganism Depository-deposited under Budapest Treaty terms. 



Deoosrted Materials 


Accession Nun*er 




E^DH5/k>S1 -8791a 


BP-2593 


9/15/1989 


(This clone contains 427 bp of the HS5 domain of J1) 


E^HB101/|?U1-1216c 


BP-2594 


9/15/1989 


CThis done contains 351 bp of the E/NS1 domains of J1) 


E^HB101/|pU1-4652d 


BP-2595 


9/15/1989 


(This done contains 583 bp of the NS3 domain of J1) 


&fifi!iDH5ayk>Sl-713c 


BP-2637 


11/1/1989 


(This done contains 580 bp of the E domain of J1) 


E. CQli DH5ayk)S7-28c 1 


BP-2638 


11/1/1989 


(This done contains 552 bp of the C/E domain of J7) 


I^DH5a/psM519 


BP3081 


8/30/90 



[0166] The following vectors described in the Examples were deposited with the American Type Culture Collection 
(ATCC), 12301 ParWawn Dr., Rockville, Maryland 20852, and have been assigned the following Accession Nun*)efs. 
The deposits were wade under the tenns of the Budapest Treaty. 



Deposited MRteri^l? 


Accession Number 


Deoosit Date 


TC-600BP (in E. poli HB10iyk5GEM3Z) 


68393 


9/11/90 


JH-400bp (in E. coli HB101/|pGEM32) 


68394 


9/11/90 


AW-300bp (in E. coli HB101/|dGEM3Z) 


68392 


9/11/90 


AW-770bp-N (in HB10iy|3M1E) 


68395 


9/11/90 


AW-700bp-C or AW-700bp-N (in E. coli DH5a-P/M13mp10) 






J1 5-1-1 (inE^DH5a-F/M13mp10) 






HCV-J1Xgt22 


40884 


9/6/90 



[0167] These deposits are provided for the convenience of those skilled in the art These deposits are neither an 
admission that such deposits are required to practice the present invention nor that equivalent embodiments are not 
within the skill of the art in view of the present disdosure. The public availability of these deposits is not a grant of a 
license to make, use or sell the deposited materials under this or any other patent The nucleic add sequences of the 
deposited materials are incorporated In to present disclosure by reference, and are controlling if in conflict with any 
sequences described herein. 

[0168] While the present invention has been described by way specific exanples for the benefit of those in the field. 
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the scope off the invention is not limited as additional embodiments will be apparent to those off skill in the art from the 
present disclosure. 

Claims 

5 

Claims fbr the following Contracting States : AT, BE, CH, U, DE, DK, FR, GB, GR, IT, LU, NL, SE 

1 . A polynucleotide in substantially isolated form conrprising a nucleotide sequence off at least 1 5 nucleotides from a 
J-1 HCV isolate, said J-1 HCV isolate having at least 90% nucleotide sequence homology with the J-1 sequence 

10 of any one of Rgures 7 to 10 or13 to 18. wherein said nucleotide sequence of at least 15 nucleotides is distinct 
from the nucleotide sequence of HCV isolate HCV-1 

2. A polynucleotide according to daim 1 wherein the J-1 HCV isolate has at least 95% homology witti the J-1 HCV 
sequence of any one of Rgures 7 to 10 or 13 to 18. 

75 

3. A polynucleotide accord ng to daim 1 wherein ttie J-1 HCV isolate has 1 00% homology with the J-1 HCV sequence 
of any one off Rgures 7 to 10 or 13 to 18. 

4. A polynucleotide according to any one off the preceding claims which comprises at least 20 nucleotides. 

20 

5. A method of detecting HCV polynucleotides In a test sample conrprising: 

(a) providing a polynucleotide as defined in any one of claims 1 to 4 as a probe; 

(b) contacting the test sample and the probe under conditions that allow fa the Ibmiation of a polynucleotide 
2S duplex between the probe and its complement in the absence off substantial polynucleotide duplex formation 

between the probe and non-HCV polynucleotide sequences present in test sample; and 

(c) detecting any polynucleotide duplexes comprising the probe. 

6. A polynucleotide comprising a sequence off at least 1 5 nucleotides ffrom a J-1 HCV isolate present in any one off 
30 plasmids pS1-8791a. bU1-1216c, bU1-4652d, pS1-713c pSl-1519. TC-600BP. JH-400BP, AW300BR AW.770- 

BP-N, or J1 5-1-1 deposited under accession numbers BP-2593, BP-2594. BP-2595, BP-2637, BP-3081, 68393. 
68394. 68392. 68395 and 40884 respectively, wherein said nucleotide sequence of at least 15 nucleotides Is dis- 
tinct from tfie nucleotide sequence of HCV isolate HCV-1 . 

35 7. A purified polypeptide comprising an amino acid sequence which: 

(a) is encoded by a nucleotide sequence as deffined in any one of claims 1 to 4 or in the HCV sequences 
deposited and defined in claim 6, said coding being in frame with the con-esponding amino acid sequences set 
out in Rgures 7 to 10 or 13 to 18; 
^ (b) conprises an antigenic detenninant; and 

(c) is distinct from the sequence off the polypeptides encoded by the HCV isolate HCV-1 

8. A polypeptide according to claim 7 which comprises at least 1 0 amino adds. 

45 9. A polypeptide according to claim 7 which comprises at least 1 5 amino adds. 

1 0. A polypeptide according to any one of daims 7 to 9 immobilised on a solid support 

1 1. An immunoassay for detecting the presence of anti-HCV embodies in a test sample which conprises: 

so 

(a) incubating the test sample under conditions that allow the formation of an antigen-antibody complex to be 
formed with a polypeptide as defined in any one of claims 7 to 10. wherein the polypeptide is not immunologi- 
cally cross-reactive with HCV-1 ; and 

(b) detecting any antigen antibody complexes formed. 

55 

12- An immunoassay according to claim 1 1 wherein the test sample comprises human Wood or a fraction ttiereof. 
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Claims for the following Contracting State : ES 

1 . A method of preparing a polynucleotide in substantially isolated form which comprises a nucleotide sequence of at 
least 15 nucleotides from a J-1 HCV isolate, said J-1 HCV isolate having at least 90% nucleotide sequence homol- 
ogy with the J-1 sequence of any one of Figures 7 to 1 0 orl 3 to 18, wherein said nucleotide sequence of at least 
1 5 nucleotides is distinct from the nucleotide sequence of HCV isolate HCV-1 . said method comprising: 

a) chemical synthesis by methods known perse; or 

b) DNA replication by methods known perse; or 

c) transaiption or reverse transcription by methods known per se; or 

d) restrictran enzyme digestion and ligation into a vecta by recombinant DNA techniques known per se. 

2. A method of preparing a polypeptide comprising an amino acid sequence which is encoded by a nucleotide 
sequence as defined in daim 1 . said method comprising: 

(0 chemical synthesis by methods known per se; or 

Gi) translation from the corresponding nucleic acid sequence by m^ods known per se; or 

(iii) expression of a recombinant expression system comprising the nucleotide sequence encoding the 

polypeptide by methods known per se; or 

Civ) isolation from virus by methods known perse. 

3. A method of detecting HCV polynucleotides in a test sample corrprising: 

(a) providing a polynucleotide as defined in claim 1 as a probe: 

(b) contacting the test sanple and the probe under conditions that allow for the tormation of a polynucleotide 
duplex between the probe emd its complement in the ak)sence of substantial polynucleotide diplex fbrmatbn 
between the probe and non-HCV polyriucleotide sequences present in test sample: and 

(c) detecting any polynucleotide dL43lexes comprising the probe. 

4. An immunoassay for detecting the presence of anti-HCV antftxxlies in a test sample which conprises: 

(a) incubating the test sample under conditions that allow the formation of an antigen-antbody complex to be 
formed with a polypeptide which is encoded by a nucleotide sequence as defined in daim 1 or a sequence of 
at least 15 nudeotides from a J-1 HCV isolate present in any one of plasmids pS1-8791a, bUM216c. bUl- 
4652d, pS1-713a pSM519, TC-600BR JH-400BR AW-300BP, AW-770-BP-N or J1 5-1-1 deposited under 
accession numbers BP.2593. BP-2594. BP-2595, BP-2637. BP-3081. 68393, 68394. 68392, 68395 and 
40884 respectively, wherein said nudeotide sequence of at least 1 5 nudeotides is distinct from the nudeotide 
sequence of HCV isolate HCV-1 , saki coding being in frame witti the corresponding amino add sequences set 
out in Rgures 7 to 10 or 13 to 18, whteh polypeptide comprises an antigenic determinant and is distinct from 
the sequence of the polypeptide encoded by the HCV isdate HCV-1 and is not immunologically aoss-reactive 
witti HCV-1; and 

(b) detecting any antigen antftxxly complexes formed. 

5. An immunoassay according to daim 6 wherein tiie test sample comprises human blood or a fraction ttiereof. 

6. A pdynudeotide in substantially isolated form corrprising a nudeotide sequence of at least 1 5 nudeotides from a 
J-1 HCV isolate, said J-1 HCV isolate having at least 90% nudeotide sequence homology with the J-1 sequence 
of any one of Rgures 7 to 10 or 13 to 18. wherein said nudeotide sequence of at least 15 nudeotides is distinct 
from the nucleotide sequence of HCV isolate HCV-1 . 

7. A polynudeotide according to daim 6 wherein the J-1 HCV isolate has at least 95% homology with ttie J-1 HCV 
sequence of any one of Rgures 7 to 10 or 13 to 18. 

8. A polynudeotide accorcfing to claim 6 wherein ttie J-1 HCV isdate has 100% homology witii ttie J-1 HCV sequence 
of any one of Rgures 7 to 10 or 13 to 18. 

9. A polynudeotide according to any one of claims 6 to 8 which conprises at least 20 nudeotides. 
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10. A polynucleotide comprising a sequence of at least 15 nucleotides from a J-1 HC V isolate present in any one of 
plasmlds pS1-8791a, bUM216c, bU1-4652d. pS1-713c pSM519. TC-600BPt JH-400BP. AW-300BP, AW-770- 
BP-N or J1 5-1-1 deposited under accession numbers BP-2593, BP-2594. BP-2595. BP-2637. BP-3081, 68393, 
68394, 68392. 68395 and 40884 respectivefy, wherein said nucleotide sequence of at least 15 nucleotides is dis- 
tinct from the nucleotide sequence of HCV isolate HCV-1 . 

11 . A purified polypeptide comprising an amino add sequence which: 

(a) is encoded by a nucleotide sequence as defined in any one of claims 6 to 9 or in the HCV sequences 
deposited and defined in claim 10, said coding being in frame with the corresporx^ing amino acid sequences 
set out in Figures 7 to 10 or 13 to 18; 

(b) comprises an antigenic determinant; and 

(c) is (fistinct from the sequence of the polypeptide encoded by the HCV isolate HCV-1 . 

12. A polypeptide according to claim 1 1 which comprises at least 10 amino acids. 

13. A polypeptide according to claim 1 1 which comprises at least 15 amino adds. 

14. A polypeptide according to any one of daims 1 1 to 13 immobilised on a solid support 
PatentansprOche 

PatentansprOche f Or fdgende Vertragsstaaten : AT, BE, CH, U, DE, DK, FR, GB, GR, IT, LU, NL, SE 

1. Polynudeotid in im wesentlichen isolierter Form, umfassend eine Nucleotidsequenz aus mindestens 15 Nudeoti- 
den aus einem J-1 HCV-lsdat wobei das J-1 HCV-lsolat eine mindestens 90%ige Nucleotidsequenzhomologie mit 
der J-1-Sequenz einer der Figuren 7 bis 10 Oder 13 bte 18 besitzt. wobei sich die Nudeotidsequenz aus minde- 
stens 15 Nudeotiden von der Nudeotidsequenz des HCV-lsolats HCV-1, wie in Rgur 12 gezeigt unterscheidet 

2. Polynudeotid nach Anspruch 1 , wobei das J-1 HCV-lsolat eine mindestens 95%ige Homologie mit der J-1 HCV- 
Sequenz einer der Rguren 7 bis 10 Oder 13 bis 18 besitzt. 

3. Polynudeotid nach Anspruch 1, wobei das J-1 HCV-lsdat eine 100%ige Homdogie mit der J-1 HCV-Sequenz 
einer der Figuren 7 bis 1 0 Oder 1 3 bis 1 8 besitzt 

4. Polynudeotid nach einem der vorstehende AnsprQche. das mindestens 20 Nudeotide umfaBt. 

5. Verfahren zum Nachweis von HCV-Pdynudeotiden in einer Testprobe, umfcissend: 

(a) Bereitstellen eines Polynudeotids, wie in einem der AnsprQche 1 bis 4definiert. als Sonde, 

(b) In-Kontakt-Bringen der Testprobe und der Sonde unter Bedingungen. cfie die Bildung von Polynudeotid- 
Doppelstrangen zwischen der Sonde und deren Kbirplementflrsfrang in Abwesenheit einer wesentlichen 
Polynudeotid-Doppelstrangbildung zwischen der Sonde und NIcht-HCV-Polynudeotidsequenzen, die in der 
Testprobe vorhanden sind, eriauben; und 

(c) Nachweis von alien Pdynudeotid-Doppelstrangen, die cGe Sonde umfassen. 

6. Polynudeotid, umfassend eine Sequenz aus mindestens 15 Nudeotiden aus einem J-1 HCV-lsolat das in einem 
der Plasmide pS1-8791a. bUM216c, bU1-4652d, pS1-713c pSM519, TC-600BP. JH-400BP, AW-300BP. AW- 
770-BP-N Oder J1 5-1-1 hinteriegt unter den Hintertegungsnummem BP-2593. BP-2594, BP-2595. BP-2637. BP- 
3081. 68393. 68394, 68392, 68395 bzw. 40884 vorhanden ist. wobei sich die Nudeotidsequenz aus mindestens 
15 Nudeotiden von der Nudeotidsequenz des HCV-lsolats HCV-1 unterscheidet 

7. Gereinigtes Polypeptid, umfassend eine Aminosduresequenz. die 

(a) von einer Nudeotidsequenz. die in einem der AnsprOche 1 bis 4 def inlert ist, Oder von den hinteriegten und 
in Anspruch 6 definierten HCV-Sequenzen codiert wind, wobei die Codierung im Leseraster mit den entspre- 
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chenden Aminosauresequertzen, die in den Rguren 7 bis 10 Oder 13 bis 18 dargestelft sind. ist; 
(b) eine antigene Detenminante umfaOt; und 
5 (c) sich von der Sequenz der von dem HCV-lsolat HCV-1 codierten Polypeptide unterscheidet. 

8. Polypeptid nach Anspruch 7, das mindestens 10 Aminosauren umla8t. 

9. Polypeptid nach Anspruch 7, das mindestens 15 Aminosdu-en umfaet. 

10 

10. Polypeptid nach einem der AnsprOche 7 bis 9, das auf einem festen Trflger immobilisiert ist 

11. Inrununoassay zum Nachweis der Anwesenheit von Anti-HCV-AntikOrpem in einer Testprobe, wobei man 

IS (a) die Testprobe unter Bedingungen inkubiert, die die Bildung eines zu bildenden Antigen-AntikOrper-Kbmple- 

xes mrt einem Polypeptid. wie in einem der AnsprOche 7 bis 10 definiert. ertauben, wobei das Polypeptid mit 
HCV-1 nicht immunologisch kreuzreaktiv ist; und 

(b) aile gebikJeten Antigen-AntikOrper-Kbmplexe nachweisL 

20 

12. Immunoassay nach Anspruch 1 1 . worin die Testprobe menschlfches Blut Oder eine Fraktion davon umfaSt. 

PatantansprOche fOr fdlgenden Vertragsstaat : ES 

25 1 . Verfahren zur Herstellung eines Polynudeotids in im wesentlichen isolierter Forni, das eine Nucleotidsequenz aus 
mindestens 15 Nudeotiden aus einem J-1 HCV-lsolal umfaBt, wobei das J-1 HCV-lsolat eine mindestens 90%ige 
Nucleotidsequenzhomologie mit der J-1-Sequenz einer der Rguren 7 bis 10 Oder 13 bis 18 besitzt. wobei sk:h die 
Nucleotidsequenz aus mindestens 15 Nudeotiden von der Nudeotidsequenz des HCV-lsolats HCV-1. unterschei- 
det. wobei das Verlahren umfaBt: 

30 

a) chemische Synthese nach an sich bekannten Verfahren; oder 

b) DNA-Replikatk)n nach an sich bekannten Verfahren; oder 

35 c) Transkription oder Umkehrtranskription nach an sich bekannten Verlahren; oder 

d) Restriktionsenzymspaltung und Ligierung in einem Vektor durch an sich bekannte rekombinante DNA-Tech- 
niken. 

40 2. Verfahren zur Herstellung eines Polypeptids. umfassend eine Aminosduresequenz. die von einer Nudeotidse- 
quenz. wie in Anspruch 1 definiert. codiert wind, wobei das Verfahren um^et: 

(i) chemische Synthese nach an sich bekannten Verfahren; oder 

45 (ii) Translation von der entsprechenden Nudeinsfluresequenz nach an sich bekannten Verfahren; oder 

(iii) Expression eines rekombinanten Expressionssystems, das die Nudeotidsequenz umfaBt, die das Polypep- 
tid codiert nach an sich bekannten Verfahren; oder 

50 (iv) Isdation aus einem Virus nach an sich bekannten VerMren. 

3. Verfahren zum Nachweise von HCV-Polynudeoliden in einer Testprobe. umfassend: 

(a) Bereitstellen eines Polynudeotids, wie in Anspruch 1 definiert, als Sonde, 

55 

(b) In-Kbntakt-Bringen der Testprobe und der Sonde unter Bedingungen, de die Bildung eines Polynudeotid- 
Doppdstranges zwischen der Sonde und seinem Komplementflrstrang in Abwesenheit einer wesentlichen Bil- 
dung von Polynudeotid-Doppetetrangen zwischen der Sonde und den in der Testprobe vorhandenden Nicht- 
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HCV-Potynudeotidsequenzen eriauben; und 

(c) Nachweis aller POlynucleotid-Doppelstrange, die die Sonde umfassen. 

4. Immunoassay zum Nachweis der Anwesenheit von Anti-HCV-AntlkOrpem in einer Testprobe, umfassend: 

(a) Inkubieren der Testprobe unter Bedingungen. die die Bildung eines zu bildenden Antigen-AntikOrper-Kbm- 
plexes mit einem Polypeplid eriauben, das von einer Nudeotidsequenz, wie in Anspruch 1 d^inlert, Oder einer 
Sequenz aus mindestens 15 Nudeotiden aus einem J-1 HCV-lsolat, das in einem der Plasmide pS1-8791a, 
bUM216c, bU1-4652d. pS1'713c pSI-1519, TC-600BP. JH-400BP. AW-300BP, AW-770-BP-N Oder J1 5-M, 
die unter den Hinteriegungsnummem BP-2593. BP-2594. BP-2595, BP-2637, BP-3081, 68393, 68394. 68392. 
68395 bzw. 40884 hinteriegt wurden, voriianden ist codiert wird. wobei sich die Nudeotidsequenz aus minde^ 
stens 15 Nudeotiden von der Nudeotidsequenz des HCV-lso-lats HCV-1 unterscheidet, wobei die Codierung 
im Leseraster mit den entspredienden Aminosfluresequenzen der Figuren 7 bis 10 Oder 13 bis 18 ist. wobei 
das Polypeptid eine antigene Determinante umfaBt und sich von der Sequenz des von dem HCV-lsolat HCV- 
1 codierten Pdypeptids unterscheidet und mit HCV-1 nicht immundogisch kreuzreaktiv ist; und 

(b) Nachweis aJler gebiMeten Antigen-AntikOrper-Komplaxa 

5. Immunoassay nach Anspruch 6. worin die Testprobe menschliches Blut Oder eine Fraktion davon umfaSL 

6. Pdynudeotid in im wesentlichen isdierter Form, umfassend eine Nudeotidsequenz von mindestens 15 Nudeoti- 
den aus einem J-1 HCV-!solat wobei das J-1 HCV-lsolat mindestens 90% Nudeotidsequenz-Homologie mrt der J- 
1 -Sequenz einer der Figuren 7 bis 10 Oder 13 bis 18 besitzt wobei skrfi cfie Nudeotidsequenz von mindestens 15 
Nudeotiden vender Nudeotidsequenz des HCV-lsolats HCV-1 unterscheidet 

7. Polynudeotid nach Anspruch 6. wobei das J-1 HCV-lsolat eine mindestens 95%ige Homologie mit der J-1 HCV- 
Sequenz einer der Rguren 7 bis 10 Oder 13 bis 18 besitzt. 

8. Polynudeotid nach Anspruch 6, wobei das J-1 HCV-lsdat eine 100%ige Homdogie mit der J-1 HCV-Sequenz 
einer der Rguren 7 bis 10 Oder 13 bis 18 besitzt 

9. PolynudeotkJ nach einem der AnsprOche 6 bis 8, das mindestens 20 NudeotWe umfaSt. 

10. PolynudeotkJ, umfassend eine Sequenz aus mindestens 15 Nudeotiden aus einem J-1 HCV-lsolat das in einem 
der Plasmide pSl-8791a. bUM216c, bU1-4652d, pS1-713c. pSI-1519, TC-600BP. JH-400BP. AW-300BP, AW- 
770-BP-Noder J1 5-1-1. hinteriegt unter den Hinteriegungsnummem BP-2593, BP-2594, BP-2595. BP-2637, BP- 
3081, 68393. 68394, 68392, 68395 bzw. 40884. vortianden ist, wobei sich die Nudeotidsequenz aus mindestens 
15 Nudeotiden von der Nudeotidsequenz des HCV-lso!ats HCV-1 unterscheidet 

11. Gereinigtes Polypeptid, umfassend eine Aminosduresequenz, die 

(a) von einer Nudeotidsequenz. wie in einem der AnsprOche 6 bis 9 definiert Oder den HCV-Sequenzen, die 
hinteriegt wurden und in Anspruch 10 definiert sind. codiert wird. wobei die Codierung im Leseraster mit den 
entsprechenden Aminosfturesequenzen, die in den Rguren 7 bis 10 Oder 13 bis 18 dargestelK sind, ertblgt; 

(b) eine antigene Determinante umfaOt; und 

(c) sfch von der Sequenz des von dem HCV-lsolat HCV-1 codierten Polypeptids unterscheidet. 

1 2. Polypeptid nach Anspruch 1 1 , das mindestens 1 0 Aminosduren umfaBt. 

13. Polypeptid nach Anspruch 1 1, das mindestens 15 Aminosfluren umfe8t. 

14. Polypeptid nach einem der AnsprOche 1 1 bis 13. das auf anem festen Trflger immobilisiert ist. 
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Revendlcatlons 

Revendlcatlons pour les Etats contractants sulvants : AT, BE, CH, U, DE, DK, FR, GB, GR, IT, LU, NL, SE 

1. Potynudtotide sous une forme sensiUement Isolde, comprenant une sequence nud^otidique d'au moins 15 
niJCI6otides d'un isolat HCV J-1 , ledit isolat HCV J-1 ayant une homologie de s^ence nucl6otidique d'au moins 
90 % avec la s^ence J-1 de Tune quelconque des figures 7 ^ 10 ou 13 ^ 18. dans lequel ladite s^uence nucl^ 
tidique d*au moins 15 nucleotides est diff^rente de la sequence nud^otidique de llsolat HCV-1 de HCV. telle que 
representee k la figure 12. 

2. Polynudeotide selon la revendication 1 dans lequel Hsolat HCV J-1 a une honrxslogie d'au moins 95 % avec la 
sequence de HCV J-1 de Tune quelconque des figures 7 ^ 1 0 ou 1 3 ^ 1 8. 

3. Potynudeotide selon la revendication 1 dans lequel Tisolat HCV J-1 a une homologie de 100 % avec la sequence 
de HCV J-1 de Tune quelconque des figures 7 ^ 10 ou 13 ^ 18. 

4. Pdynudeotide selon Tune quelconque des revendications precedentes qui comprend au moins 20 nud6otides. 

5. Methode de detection de polynucleotides de HCV dans un echant'llon de test conprenant : 

(a) la fburnilure d'un polynud6otide tel que defini dans I'une quelconque des revendications 1 h 4 comme 
sonde ; 

(b) la mise en contact de rechantillon de test et de la sonde dans des conditions qui permettent la formation 
d'un duplex pdynudeotidique entre la sonde et son complement en I'absence de fonnation appreciable de 
duplex polynudeotidiquo entre la sonde et des sequences polynud6otidiques autres que de HCV presentes 
dans rechantillon de test ; et 

(c) la detection de tous duplex polynudeotidiques comprenant la sonde. 

6- Polynudeotide comprenant une sequence d'au moins 15 nudeotides d'un isdat HCV J-1 present dans Pun quel- 
conque des plasmides pS1-8791a. bU1-1216c. bu1-4652d. pS1-713c. pSM519. TC-600BP. JH-400BP, AW- 
300BP. AW 770-BP-N ou J1 5-1-1 deposes sous les numeros d'acces BP-2593. BP-2594. BP-2595, BP-2637, BP- 
3081. 68393. 68394. 68392, 68395 et 40884 respectivement. ladite sequence nudeotidiqued'au moins 15 nucleo- 
tides etant differente de la sequence nudeotidique de Hsolat HCV-1 de HCV. 

7. Polypeptide purilte conprenant une sequence d'acides amines qui : 

(a) est codee par une sequence nudeotidique telle que deflnie dans I'une quelconque des revendications 1 d 
4 ou dans les sequences de HCV deposees et def inies k la revendication 6. ladite sequence codante etant en 
phase avec les sequences d'acides amines oon-espondantes presentees dans les figures 7 d 1 0 ou 1 3 A 18 ; 

(b) comprend un determinant antigenique ; et 

(c) est distincte de la sequence des polypeptides codes par Tisolat HCV-1 de HCV 

8. Polypeptide selon la revendication 7 qui conprend au nioins 1 0 acides amines. 

9. Polypeptide sefon la revendication 7 qui comprend au moins 1 5 acides amines. 

10. Polypeptide sefon I'une quelconque des revendications 7 69 immobilise sur un support solide. 

1 1 . Test immundogique destine k detector la presence d'anticorps anti-HCV dans un echantilton de test qui conprend 



(a) rincubation de rechantillon de test dans des conditions qui penmettent la formation d'un conplexe antigene- 
anticorps k former avec un polypeptide tel que defini dans I'une quelconque des revendications 7 6 10. le pdy- 
peptide ne produisant pas de reaction immunologique croisee avec HCV-1 ; et 

(b) la detection de tout conplexe antigene-anticorps forme. 

12. Test immunologique selon ia revendication 1 1 dans lequel rechantillon de test comprend du sang humain ou une 
fraction de celui-d. 
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Revendlcatlons pour i'Etat contractant sulvant : ES 

1. Mdthode de preparation d*un polynud6olide sous une forme sensiWement isol6e qui conprend une s^uence 
nucl6olidique d-au moins 15 nuclides tfun isolat HCV J-1, ledit isolat HCV J-l ayant une homologie de 
sequence nudtotidique rfau moins 90 % avec la s^uence J-1 de rune quelconque des figures 7 a 10 ou 13 ^ 18. 
ladite sequence nud^otidique d'au moins 15 nud6otides ^ant diff6rente de la sequence nudMidique de I'isdat 
HCV-1 de HCV, ladite mdthode comprenanl :. 

a) la synthase chimique par des mdthodes connues per se ; ou 
b} la replication d*AON par des mdthodes connues per se ; ou 

c) la transcription ou rdtrotranscription par des mdthodes connues perse ; ou 

d) la digestion par des enzymes de restriction et la ligature dans un vecteur par des techniques de I'ADN 
recombinant connues perse. 

2. Methods de preparation tfun polypeptide comprenant une sequence d'acides amines qui est codee par une 
sequence nudeotldique telle que definie k la ro^endication 1. ladite methode conprenant : 

0) la synthese chimique par des methodes connues per se ; ou 

(ii) la traduction d partir de la sequence d'adde nudeique conrespondante par des methodes connues perse 
;ou 

(iii) Texpression d'un systeme d'expression recombinant comprenant la sequence nudeotidique cedent pour te 
polypeptide par des methodes connues perse ; ou 

fiv) ftsolement k partir de virus par des methodes connues per se. 

3. Methods de detection de polynudeotides de HCV dans un echantillon de test conprenant : 

(a) la fourniture d'un polynudeotide tel que defini k la revendications 1 comma sonde ; 

(b) la mise en contact de rechantillon de test et de la sonde dans des conditions qui permettent la formation 
d'un duplex polynudeotidique entre la sonde et son complement en I'absence de formation appreciable de 
duplex polynudeotidique entre fa sonde et des sequences poiynudeotidiques autres que de HCV presentee 
dans rechantillon de test ; et 

(c) la detection de tous duplex polynud6olidiques comprenant la sonde, 

4. Test immunotogique destine k detecter la presence d'anticorps anti-HCV dans un echantillon de test qui corrprend 



(a) rincii)ation de rechantiiton de test dans des conditfons qui permettent la formation d*un conplexe antigene- 
anticorps k former avec, un polypeptide qui est code par une sequence nudeoticfique telle que definie k ia 
revendication 1 ou une sequence d^au moins 1 5 nudeotides d'un isolat HCV J-1 present dans run quelconque 
des plasmides pSl-8791a, bU1-1216c bU1.4652d, pS1-713c, pSI-1519. TC-600BP. JH-400BP, AW-300BP. 
AW 770-BP-N ou J1 5-1-1 deposes sous les numeros d'acces BP-2593. BP-2594, BP-2595, BP-2637. BP- 
3081, 68393. 68394, 68392. 68395 et 40884 respectivement. ladrte sequence nudeotidique d'au moins 15 
nudeotides eiant differente de la sequence nudeotidique de llsolat HCV-1 de HCV. ladite sequence codante 
etanten phase avec les sequences d'acides amines con-espondantes representees dans les figures 7 e 10 ou 
1 3 A 1 8, lequel polypeptide conprend un determinant antigenique et est distinct de la sequence du polypeptide 
code par llsolat HCV-1 de HCV et ne produit pas de reaction immunotogique croisee avec HCV-1 ; et 

(b) la detectfon de tous complexes antigene-anticorps formes. 

5. Test immunologique selon la revendication 6 dans lequel rechantiiton de test comprend du sang humain ou une 
fraction de celui-ci. 

6. Polynudeotide sous une forme sonsWement isoiee conprenant une sequence nudeoticfique d'au moins 15 
nucleotides d'un isolat HCV J-1. ledit isolat HCV J-1 ayant une homologie de sequence nudeotidique d*au moins 
90 % avec la sequence J-1 de I'une quelconque des figures 7 e 10 ou 1 3 6 18. dans lequel ladite sequence nudeo- 
tidique d*au moins 15 nudeotides est differente de la sequence nudeotidique de Ttsolat HCV-1 de HCV. 

7. Polynudeotide selon la revendication 6 dans lequel llsolat HCV J-1 a une homologie d'au moins 95 % avec la 
sequence de HCV J-1 de Tune quelconque des figures 7 ^ 10 ou 13 6 18. 
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8. Polynudtotide selon la revendication 6 dans lequel Hsolat HCV J-1 a une homologie de 1 00 % avec la s^uence 
de HCV J-1 de Tune quelconque des figures 7 ^ 10 ou 13 & 1 8. 

9. Polynucltetide selon i'une quelconque des revencfications 6 d 8 qui conprend au moins 20 nucfdotides. 

s 

10. Polynucleotide comprenant une sequence d*au moins 15 nucleotides d'un isolat HCV J-1 present dans Tun quel- 
conque des plasmides pS1-8791a. bU1-1216c. bU1-4652d. pS1-713a pSM519. TC-600BR JH-400BP. AW- 
300BP, AW 770-BP-N ou J1 5-1 -1 d^pos^s sous les num6ros d*acc6s BP-2593. BP-2594. BP-2595, BP-2637. BP- 
3081. 68393. 68394. 68392. 68395 6t 40884 respectivement. ladite sequence nuddotidiqued'au moins 15 nucl6o- 

10 tides etant cfiff6rente de la sequence nucidotidique de I'isolat HCV-1 de HCV. 

11. Polypeptide purifie comprenant une sequence d*acides amines qui : 

(a) est cod6e par une sequence nud^otidique telle que d^finie dans I'une quelconque des revendications 6 k 
IS 9 ou dans les sequences de HCV depos6es et d^finies k la revendication 10. ladite sequence codante etant 

en phase avec les sequences d*acides amines correspondantes presentees dans les figures 7 ^ 1 0 ou 1 3 ^ 1 8 

(b) comprend un determinant antigenique ; et 

(c) est (fistincte de la sequence du polypeptide code par ilsdat HCV-1 de HCV 

20 

12. Polypeptide selon la revendication 1 1 qui comprend au moins 10 acides amines. 

13. Polypeptide selon la revendication 1 1 qui comprend au moins 15 acides amines. 

25 14. Polypeptide selon I'une quelconque des revendications 11613 immobilise sur un support solida 
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JT 1 AGCCGAGTAGTCTrGGGTCGCGAAAaCCTTCTOiTACrG^ 60 

discrepancy 

clone 

altered aa 

ietSerThrAsnProLysProGlnArjLys 
JT SI TGCCCCGGGAGGTCTCGTAGACCGTGCATCATGAGCAC.\AATCCTAAACCCCAA.\GAAA.A 120 

TG 
bl 
— Arff 

ThrLysArfAsnThrAsnArcArffProGlnAspValLysPhcProGlyGlyGlyGlnlle 
JT 121 ACCAAACGTAACACCAACCGTCGCCCACAGGACGTTAAGTTCCCGGGCGGTGGTCAGATC 180 

. c r 

Leu 

ValGlyGlyValTyrLeuLeuProArsArgGlyProArgLcuGlyValArgrAlaThrArff 
JT 181 GTCGGTGGAGTTTACTTGTTGCCGCGaGGGGCCCCAGGTTGGGTGTGCCTGCGACTAGG 240 
T A 
b b 



LysThrSerGluArgSerGlnProArgGlyArfArjGlnProIleProLysAlaArjArff 
J i 241 AAGACTTCCGAGCGGTCGCAACCTCGTGGAAGCCGACAACCTATCCCCAAGGCTCGCCGG 300 

A 
b 



ProGluGlyArgrrhrTrpAlaGlnProGIyTyrProTrpProLeuTyrGlyAsnGluGly 
J7 301 CCCGAGGGaGGACCTGCCCTCAGCCTOWTATCCrTGGCCCCTCT^ 360 



- LeuGlyTrpAlaGlyTrpLcuLeuScrProArfGlySerArfProSerTrpGlyProAsn 
J" ™»XrrGGGCAGGATGGCTCCTGTCACCCCGCGGCTCrC^ 420 

A T C 

^ c b 

END ._-nir 

AspProArfArcArfSerArcAsnLeuGlyUysVallleAspThrLcuThrCysGlyPhe 
J7 421 GACCCCCCGCGTAGCTCGCGTAATTTGGGTAAGGTCATCGATACCmACATGCCGCnC 480 

C 
1 

Leu 

FIGURE l-l 
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AlaAspLeuMetGIyTyrlleProLeuValGlyAlaProLeuGIyOIyAlaAlaArffAia 
J7 481 GCCGACCTCATGGCGTACATTCCGCrrGTCGGCGCCCCCTTAGGGWCGCTGCCACGGCC 540 

c c 

c b 



LeuAlaHlsGIy 
J7 541 CTGGCACATGGT 552 



FIGURE 1-2 
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ProLeuVaiGlyAlaProLeuGlyGlyAlaAlaAr^AlaLeuAlaHisGlyVaUrjVal 
J 1 I TCCGCTCGTCGGCGCCCCCTTAGGGGGCGCTGCCAGGGCCCTGGCACATGGTGTCCGGGTT 6 1 
discrepancy c 
clone d 
altered aa Ser 

LeuGluAspGlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPheSerllePhe 
62 CTGGAGGACGGCGTGAACTATGCAACAGGGAAmGCCCGGTTGCTCTTTCTCTATCTTC 121 



LeuLeuAlaLeuLeuSerCysLeuThrlleProAlaSerAlaTyrGluValArrAsnVal 
122 CTCTTGGCTCTGCTGTCCTGTTTGACCATCCCAGCTTCCGCTTATGAAGTGCGCAACGTG 181 
A T 
€ d 



SerGlylleTyrHisValThrAsnAspCysSerAsnSerSerlleValTyrGluAlaAla 
182 TCCGGGATATACCATGTCACAAACGACTGCTCCAACTCAAGCATTGTGTATGAGGCGGCG 241 
T 
d 



AspVallleMetHlsAlaProGlyCysValProCysValArjGluAsnAsnSerSerAry 
242 GACGTGATCATGCATGCCCCCGGGTGCGTXK:CCTGCGTTCGGGAGAAC AATTCCTCCCCT 30 1 

C 
d 



CysTrpValAlaLeuThrProThrLeuAlaAlaArtAsnAlaScrValProThrThrThr 
302 TGCTGGGTAGCGCTCACTCCCACGCTCGCGCCaGGAATCCCAGCCTCCCCACTAC^ 361 



LcuArtArfHlsValAspLeuLeuValGlyThrAlaAlaPheCysSerAlaMecTyrVal 
3S2 mCGACGCCACCTCCACTrGCTCGTTGGGACGGCTGCTrrCT^ 421 
G 
d 



FIGURE 2-1 
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GlyAspLeuCysGlySerValPheLeuIleSerGlnLeuPheThrPheSerProAr«rArff 
422 GGGGATCTCTGCGGATaCTTTTCCTCATCTCCCAGCTGTTCACCTTCTCGCCTCGCCGG 481 

T 
d 



HIsGluThrValGlnAspCysAsnCysSerlleTyrProGlyHisValSerGlyHisArc 
482 CATGAGACAGTACAGGACTGCAACTGCTCAATCTATCCCGGCCACGTATCAGGCCATCGC 341 

T 



MetAlaTrpAspHetMetMetAsnTrpSerProThrAla 
542 ATGGCTTGGGATATGATGATGAACTGGTCGCCCACGGCA 580 



FIGURE 2-2 
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AsnTrpSerProThrAla 
Jl I AACTGGTCGCCCACGGCA 18 

discrepancy : : : 

clone 

altered aa 

AiaLeuValValSerGlnLeuLeuArtrlleProGlnAlaVaLMetAspNetValAlaGly 
Jl 19 GCCTTAGTGGTGTCCCAGTTAaCCGGATCCCACAAGCTGTCATGGACATGCTGGCGGGG 78 



AlaHisTrpGlyValLeuAlaGlyLeuAlaTyrTyrSerMetValGlyAsnTrpAlaLys 
Jl 79 GCCCACTGGCGAGTCaAGCCGGCCTTGCCTACTATTCCATGGTGGGGAACTGGGCTAAG 138 

G A 
i I 



V alLe ulleVaLMetLeuLeuPheAlaGlyValAspGlyHlsThrArgValThrGlyGly 
Jl 139 GTmGAmTGATGaAaCTTTGCCGGCGTTGACGGGCATACCCGCGTGACGGGGGGC 198 

AG A 

» i 
Ser 

ValGlnGlyHlsValThrSerThrLeuThrSerLeuPheArffProGlyAiaSerGlnLys 
J I 199 GTGCAAGGCCACGTCACCTCTACACTaCGTCCCTCTTTAGACCTGGGGCCTCCCAGAAA 258 
T G 
c 1 

Ala 

IleGlnLeuValAsnThrAsnGlySerTrpHlsIleAsnArsThrAlaLeuAsnCysAsn 
Jl 259 AnaCCTTGTAAAaCCAATGGCAGTTGGCATATCAACAGGACT^ 318 
TC T T 

11 1 t 
SerLeu 

" • • • • 

AspSerLeuGlnThrGlyPheLeuAlaAlaLeu 
Jl 319 GACTCCCTCCAAACTGGCTTCCTTGCCCCCCTC 351 



FIGURE 3 
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Ser 

Jl I CTCA 4 

discrepancy 

clon« 

altered aa 

VallleAspCysAsnThrCysValThrOlnThrVaiAspPheSerLeuAapProThrPht 
Jl 5 CTGATCGACmn'AACACATtmTrCACmCUCGGTCGATTTCACK^^ 64 

G 
e 

Ala 

• • • • • • 

ThrllcGluThrThrThrValProGlnAapAlaValSerArgThrGlnArcArfGlyArc 
Jl 65 ACCATCGAGACGACGACCGTGCCCCAAGATGC(KimCGCGCACGaGCGGCGAGGTAG^ 124 



ThrGlyAriGlyArsArsGlylleTyrArffPheValTbrProGlyGluArfProSerAla 
Jl 125 ACTGGCAGGGGCAGGAGAGGCATCTATAGGTTTGItUCTCCAGGAGAACGGC^ 184 



MetPhcAspSerSerValUuCyaGluCytTyrAspAlaClyCysAlaTrpTyrGluUtt 
Jl 185 ATGTTCGATTCrraOTCCTATGTGAGTGTrATGACCCCGGC^^ 244 

A 

GlJ(.) 

TteProAlaOliinirSerValArfUuArsAlaTyrUuAsnThrFroGlyUi^ 
Jl 24S ACGCCOGCTtSAGACCTOGGTTAGGnGCGGGOTACCTAAATAaCCAGGGT^^ 304 



CytClnAapHlaUuGluPheTrpciuSerValPheTlirGlyUuTlirHiflleAapAU • 
Jl 30S TGCaGGACCATCTOUGTrCTGGCUGAGCGTCT^ 364 



HisPheUttSerGlaThrl^GliiAiaGlyAspAaiiFheProiyrUuValAla'trrGlA 
Jl 36S aCTTCTI G T q CCACACTAAGCAGCCAGGAGACAACTrCCCCTACCTGGTAGaTACCU 424 

FIGURE 4-1 
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4W GCaCAGTGTCCGCCAcScTAAGGCTSSccSc^^ 

C 
e 

485 CTCATAOKHrrAAAaXrACCCTCaCGtWXMCGCCCCia^ 

A 

e e 
Al* Arc(-) 

ValOlnAanGluV«iThrLeuThrHi«ProIleThtLyt 
545 GTCCAGAATGAGGTCACCCraaaCCCTATAACCAAA 583 



FIGURE 4-2 
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LeuThr 

JL I CCTCACC 7 

discrepancy 

clone 

altered aa 

ArgAspProThrValProLeuAlaArrtlaAlaTrpGluThrAlaArjHisThrProVai 
8 CGTGACCCCACCGTCCCCCTTtKXSCGGGCTGCGTGGGAGACAGCTAGACACACTCCAGTC 67 

C 

Thr(-) 

AsnSerTrpLeuGlyAsnllelleMetTyrAlaPraThrLeuTrpAlaAriMetlleLeu 
68 AACTCCTGGCTAGGCAACATCATCATGTATGCGCCCACriTGTGGGCAAGGATGATTCTG 127 

T 

Ile(.> 

nctThrHlsPhePheSerlleLcuLeuAlaGlnGluGlnLcuGluLysAiaLeuAspCys 
128 ATGACTCACTTOTCTCCATCOTCTAGCCCACGAGCAAmGAAAAAGCCCTGGATTGT 187 



GlnlleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuProGlnlielleGluArff 
188 CAAATCTACGGGGCCTGmCTCCATTGAGCCACTTGACCTACCTCAGATCATTGAACGA 247 



LeuHlsGlyLeuSerAlaPheSerLeuHlsScrTyrSerProGlyGlulieAsnAr^al 
248 CTCCATGGTCnAGCGCATTTTCACTCCATAGmCTCTCCAGGTCAGATCAATAGO 307 



AlaSerCysLeuArgLysLeuGlyValProProLcuArfValTrpArcHlsArrAlaArf 
308 GCrrCATGCCTCAGGAAGCrnXXOTACaCCCnGaAGTC^ 367 



SerValArcAlatysLcuLeuSerGlnGlyGlyArgAlaAlaThrLysGlyLysTyrLeu 
368 AGTGTCOiCGCTAAGCTACTGTCCCAAGGCGGGAGGGCXGCCACTrc 427 

G 
C 

Gin(>) 



FIGURE 5 
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J7 1 AGCCCAGTAGTGTTCGGTCGCGAAAGCCCnTGTGGTACTGCCrGATA(KWTCOT 60 

HCVl 



MetSerThrAsnProLysProGlnArffLys 
61 TGCCCCGGGACGTaCGTAGACCCTCCATCATGAGaCAAATCCTAAACCCCAAAGAAAA 120 
HCVl C G T A 

Lys 

ThrLysArcAsnThrAanAriArcProGlnAspValLysPheProGlyGlyGlyClnlle 
J7 121 ACCAAACGTAACACCAACCGTCGCCaaGGACGTTAAGTTCCCGGGCGGTCGTCAGATC 180 
HCVl A C T C 

Asa 

I- yfli^iy^^yV^iT'yrLeuLeuProArtXrgClyPraArjLcuGlyValArgAlaT 
J . 181 GTCGGTCGAGmAmGTTGCCGCGCACGGCCCCCAGGTTGGGTGTGCCTCCCAaAGG 240 
HCVl T T A . C G A 



^y«T^^fSerGluArjSerGlnProAriGlyArfArgGlnProIleProLysAlaAr«Arr 
J7 241 AAGACTTCCGAGCGGTCGCAACCTCCTCGAAGGCCACAACCTATCCCCAAGGCTCGCCGG 300 
HCVl A T A T G T 



,^ P^^luGlyArgThrTrpAlaCInProGlyTyrProTrpProLeuTyrGlyAsnGluGly 

J7 301 CCCGAGGGaGGACCTGGGCTCACCCTGGGTATCCTTGGCCCCTCTATGCCAATGACGGC 360 

HCVl C C 



n-. • if^^^P^«Gl/rrpUuLeuSer Pr oATfGIySerAr fProScrTrpGly^ 

J7 361 TTCGGCTCGGaGGATGGCTCCTtmUCCCCGCGCGT^^ 420 
HCVl GC C T T C CA 

- ^ Thr 

T-T i?KE^^*^^^^*^*'^^^^*'^***^^^»V*^^l«AspThrLeuThrCysGlyP 

J7 421 GACCCCCGGCCTACGTCGCGTAATTTGCGTAAGGTCATCGATACCCTTACATGCGGOT 480 

HCVl C c 



FIGURE 6-1 
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AlaAipLeuMetClyTyrlleProLeuValGlyAlaProLauGlyGlyAlaAlaArcAla 
„J1 GC«»ACCTCATG«KrrACAncCGCTr(rrcaKXCCCCCTrAGGGa5COT^ 340 

HCVl AC TC T A 



LeuAlafllsGLy 
J7 S41 CTGCCACATG6T 532 
HCVl G C 781 



FIGURE 6-2 
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„ , ,J"'f"!fi<'lyAlaProLeuGlyGlyAlaAiaArfAlaLeuAlaHlsGlyValAr«Val 

J I 1 TCC(K:rcGTcaK:GcccccnAaxxxK:GCTGccAax;cccTaK:ACATaiTGTccGo^ si 

"CVl A TC T A G C 



„ ^$}"*sPGlyValAsnTyrAlaThrGlyAsnUuProGlyCysSerPheSerIlePhe 
Jl 62 CTGGAGCACGCCCTGAACTATGCAAaCCGAATrTGCCCGGTOCTCTTTCTCTATCTTC 121 
nCVl A CC T T 



J, ,„ ^i^"AlfJf"LeuSerCys^ 

urir, '^SP'^CTWrrGTCCTGmGACaTCCCAGCTOCGmATGAAGTGCGCAACGTG 181 

HCVl TC C C T C TG G C G C CC TCC 

Val Gin Ser 

••• 

T, ,00 ESESiy'^«"''y^"isValThrAsi»AspCysSerAsnSerSerIleValTyrGluAlaAia 
uri\ T^^^^^'iTATACCATCTCACAAACGACTGCTCCAACTCAAGCAnGTGTATGACGC^ 241 

HCVl i,<5CT C CTTCT GT C C 

Tar Leu p^o 

urih '^*$CT5*TCATGCATCCCCCCGGGTGCGTCCCCTCCGTTCGGCAGAACAATrCCTCCCGT 301 

HCVl TCCC CATG CT TGGCG GAG 

Ala Leu Thr oiy Ala 

T1 nno S^!l!?!;^*if"ThrProThrUuAiaA^ 

Jl 302 T(OT(XKn-A<X»:TaCTCCCACGCTCCC(XWa(^^ 361 
"C^l T GAGCTGGCA GCAAC GGUG 

««t Yal Thr AapGlyLysLeu Ala Gin 

• • • , , 

T, i*'?*5f*5f5*'**"*P''*"'-*"^*l<'l3^fAi«Ai*J^«<:y»SerAla«^ 
t.^,;, - Fi"**^'^*^°™^*™<^<*"CCCACGCCTGCTTTCT^ 421 
HCVl .-CT TA TC TC CCCACC TGCCC 

Ser TbrLeu Leu 

11 ^,-1 5iJi'ilifli5[*<Sl3^«''ValPheLeulieSerGlnLeuPheThrPheSerProAr«Arr 

1'- • »- J »»CG<UTCT{aTnCCTaTCTCCCAGCrGTn:ACCTTCTCCCCTCCCCG6 481 
HCVl CA 6 CTT0S6A TCAGC 

ValGly 

FIGURE 7-1 
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HlsGluThrValGlnAspCysAsnCysSerlleTyrProGlyHisValSerGlyHlsArr 
Jl 482 aTGAGACAGTACAGGACTGCAACTGCTCAATCTATCCCCGCaCCTATCAGGCCATCGC 341 
HCVl CTG GACG ACT T T TAAGTC 

Trp Thr Gly iieThr 






MetAlaTrpAspMetfletMetAsnTrpSerProThrAla 
Jl 342 ATGGCTTGCGATATGATGATGAACTGGTCCCCCACGCCA 380 
HCVI A C T A G 

Thr 



FIGURE 7-2 
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\snTrpSerProThrAla 
.11 1 \ACTGGTCGCCCACGGCA 18 

HCVl. c T AG 

Thr 

AlaLeuValValSerGlnLeuLeuArflleProGlnAiaValWctAspMetValAlaGly 
Jl 19 GCCTTAGTGGTGTCGCAGTTACTCCGGATCCCACAACCTGTCATGGACATGGTGGCCGGG T3 
HCVI . GGUGTCG at ACTT 

.necAla IleLeu He 

AlaHlsTrpGlyValLeuAlaGlyLeuAlaTyrTyrSerletValGlyAsnTrpAlaLys 
J I 79 GCCCACTGGGGAGTCCTACCGGGCCTTGCCTACTATTCCATGGTGGGGAACTGGGCTAAG 138 
HCVl T G A A .G T TC G 

He Phe 

ValLeuIleValMetLeuLeuPheAlaClyValAspClyHisThrArffValThrGlyGly 
Jl 139 GTmGAmTGATGCTACrCTTTGCCGGCGTTCACGGGCATACCCGCGTGACGGGGCGG 198 
HCVl CC G A C G A C C C A A C C A 

Val Leu AlaGIu His 

ValClnGlyHisValThrSerThrLeuThrSerLeuPheAriProGlyAlaSerGlnLys 
Jl 199 GTaAAGGCCACGTCACCTCTACACTCACGTCCCrCTTTAGACCTGGGCCGTCCCAGAAA 258 
HCVl AGTGCC ACTGTC GG T TGTTAG C CGC AC CAAG C 

SerAla ThrVal GlyPheVal LeuAla Lys Asn 

•••••• ««« 

IleGln LeuVa lAsnThrAanGlySerTrpHisIlcAanArgThrAlaLeuAsnCysAsn 
Jl 259 ATrCAGCTTGTAAACACCAATGGCAGTrCGCATATCAACAGGACTGCCCTGAACTGCAAT 318 
HCVl GC GAG C CCTCG 

Val He Leu Ser 

• • • • 

AspSerLeuGInThrGIyPheLeuAlaAlaLeu 
Jl 319 GACTCCCrCCAAACrGGGrrrCCTTGCCGCGCrG 351 
HCVl TAG ACCCGGTGAG T 

Aan Trp Gly 



FIGURE 8 
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Jl 1 4JiJ)f*"i;A«pCy»AsnThrCysValThrClnThrVai 

ECVl ®^"^'=**<^CATCCCACCTrCACCATCGACACOACCACCCTGCCCCAAGATCCCGTT 

T ATCGCC 0 tC 

21m. 

^ TATOC CACC CA 

LyaPro 

«i ° %g e 0 c c c 

* CTAACAAQ 

Thr 

**** Gly • 

Jl 341 ACAoocCTci^e?i?S:«i?^iSi2i2ii;i^ 

HCVl *'^*'*'"""^*^$'^£*^A0A£OCCCACTrCTTGTCCCA0ACTAAGCAG6CAGGAGACAAC 

^ * ■ TC A A ACT G 8 

Sar Clu 

Jl 401 i5^J?2sii2Xii^*I^«"«laAlaTiiivaiC7«AlaAr^^ 

Jl 401 TTWCCTACCTOOTAOCATACCAACCCACAOTOTOCGCCAOGCCTM^ 

* S,^^ ' ° 

Pitt 

JX 481 ''SES^Sg^m^g^y '^^'t^'^ ^^^ArcLeuLyaPreThrLauHiaOlyPyoThr 
HCVl ^'='^'*^J<J*AAm«GM«roiCTCATACQ«TAMGCCTACOCTaCAC(!GGCf^^ 

TOTCC CCCT A 

ACAOCTT AA 00 AGC 



Jl 
HCVl 



lU Vml 

Jl 5#1 AAA 
HCVt tae*re»tcac«trc*trte 

FIGURE 9 
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LeuThr 
Jl I CCTCACC T 
HCVl 

.KrcAspProThrValProLeuAlaArsAlaAiaTrpGIurhrAiaArsHlsTtirProVai 
Jl 8 CGTGACCCCACCCTCCCCCnGCGCCK^GaGCGTGGGAGACAGCTAGAaaCTCCAGTC 67 
HCVl . T AAC C A A A 

Thr 



AsnSerTrpLeuGlyAsnllelleMetTyrAlaProThrLeuTrpAlaArsMecIleLeu 

iTATG 
T 

Phe 



Jl 68 AACTCCTGGCTAGGCAACATCATaTGTATGCGCCCA Cin 'r C ' l GGGCAAGGATGATTCTG 127 
HCVl T A T C AC G A 



MetThrHisPhePheSerlleLeuLeuAlaGlnGluGlnLeuGluLysAlaLeuAspCys 
Jl L28 ATGACTaCTrCTTCTCCATCCTTCTAGCCaGGAGCAACTTGAAAAAGCCCTGGATTGT 187 
HCVl C T TAG G A AG C G C G C C 

Val He AfcAsp Gin 



GInlleTyrGlyAlaCysTyrScrlleGluProLeuAspLeuProGlnllelleGluArc 
Jl 188 CAAATmCCGGCCCTGTTAaCCATTGAGCCACTTGACCTACCTCAGATCATTGAACCA 247 
HCVl GG C AA T CACA 

Glu Pro Gin 



LeuHlsGlyLeuSerAlaPheSerLeuHlsSerTyrSerProGlyGluIleAsnArcVal 
Jl 248 CTCCATCGTCnAGCCCATnTCACTCCATAGTTACTCTCCAGGTGAGATCAATAGGGTG 307 
HCVl C C C AT 

AlaSerCysLeuArrLysLeuGlyValProProLeuArffValTrpArcfllsArsAlaArc 
Jl 308 GCTTCATGCCrCAGGAAGCTTGCGGTACCACCCTTCCCAGTCTGGAGACATCGGGCCAGA 367 
HCVl CG AA G CT C'CG" 

Ala Ala 

- * SarValArsAlaLysLeuLeuSerGlnGlyOlyArcAlaAlaThrLysGlyLysTyrLau 
Jl 368 AGTCTCCGCGCTAAGCTACTGTCCCAAGGGGGGAGGGCCGCCACTTGTGGCAAGTACCTC 427 
HCVl C GTGAGACTTA 

Arc AlaArr Il« 
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- 2 6 7 GCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
CGCAGATCGGTACCGCAATCATACTCACAGCACGTCGGAGGTCCTGGGGGGGAGGGCCCT 

-207 GAGCCATAGTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCT 
CTOXSTATCACCAGACGCCTTCGCCACTCATGTGGCCTTAACGGTCCTGCT^ 

-147 TTCTTGGATCAACCCGCTCAATGCCTGGAG ATTTGGGCGTGCCa^ 

AAGAACCTAGTTGGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCTGACGATCG 

- 8 7 CGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCT^^ 

GCTCATCACAACCCAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAACGCTC^ 

-27 CCCGGGAGGTCTCGTAGACCGTGCACC -1 
GGGCCCTCCAGAGCATCTGGCACGTGG 



XetSerThrAsnProLysProGlnLysLysAsnLysArgAsnThrAsnArgArgProGln 
1 ATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAACACCAACCGTCGCCCACAG 
TACTCGTCCTTAGGATTTGGAGTTTTTTTTTTGTTTGCATTGTGGTTGG 

AspValLysPheProGlyGlyGlyGlnlleValGlyGlyValTyrLeuLeuProArgArg 
6 1 GACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGG 
CTGCAGTTCAAGGGCCCACCGCCAGTCTAGCAACCACCTCAAATGAACAACGGCGCGTCC 

GlyProArgLeuGlyValArgAlaThrArgLysThrSerGluArgSerGlnProArgGly 
121 GGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGAGCGGTCGCAACCTCGAGGT 
CCGGGATCTAACCCACACGCGCGCTGCTCTTTCTGAAGGCTCGCCAGCGTTGGAGCTCCA 

ArgArgGlnProIleProLysAlaArgArgProGluGlyArgThrTrpAlaGlnProGly 
181 AGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAGGACCTGGGCTCAGCCCGGG 
TCTCCAGTCGGATAGGGGTTCCGAGCAGCCGGGCTCCCGTCCTGGACCCGAGTCGGGCCC 

TyrProTrpProLeuTyrGlyAsnGluGlyCysGlyTrpAlaGlyTrpLeuLeuSerPro 
141 TACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGCGGGATGGCTCCTGTCTCCC 
ATGGGAACCGGGGAGATACCGTTACTCCCGACGCCCACCCGCCCTACCGAGGACAGAGGG 

ArgGlySerArgProSerTrpGlyProThrAspProArgArgArgSerArgAsnLeuGiy 
301 CGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCGTAGGTCGCGCAATTTGGGT 
GCACCGAGAGCCGGATCGACCCCGGGGTGTCTGGGGGCCGCATCCAGCGCGTTAAACCCA 

LysVallleAspThrLeuThrCysGlyPheAlaAspLeuMetGlyTyrlleProLeuVal 
361 AAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCATGGGGTACATACCGCTCGTC 
TTCCAGTAGCXATGGGAATGCACGCCGAAGCGGCTGGAGTACCCCATGTATX^C^^ 

GlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAlaHisGlyValArgValLeuGluAsp 
421 GGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAAGAC 
CCGCGGGGAGAACCTCCGCGACGGTCCCGGGACCGCGTACCGCAGGCCCAAGACCTTCTG 

GlyValAsnXyrAlaThrGlyAsnLeuPreXalyCysSerPheSerllePheLeuLeuAla 
481 GGCGTG AACTATGCAACAGGGAACCTTCCTGGTTGGTCTTTCTCTATCTTCC^ 

CCGCACTTGATACGTTGTCCCTTGGAAGGACCAACGAGAAAGAGATAGAAGGAAGACCGG 
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LeuLeuSerCysLeuThrValProAlaSerAlaTyrXjlnValArcAsnSerThrGlyLeu 
541 CTGCTCTCTTCCTTGACrGTGCCCGCTTCGGCCTACCAAGTGCGO^C^ 

GACGAGAGAACGAACTGACACGGGCGAAGCCGGATGGTTCACGCGTTGAGGTGCCCCGAA 

TYrHisValThrAsnAspCysProAsnSerSerlleVaiTTrGluAlaAlaAspAlalle 
601 TACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTACGAGGCGGCCGATGCCATC 
ATCGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACATGCTCCGCCGGCTACGGT^^ 

LeuHisThrProGlyCysValProCysValArgGluGlyAsnAlaSerArgCysTrpVai 
661 CTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAACGCCTCGAGGTGTTGGGT 
GACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTTGCGGAGCTCCAO^ 

AlaMetThrProThrValAlaThrArgAspGlyLysLe'uProAlaThrGlnLeuArgArg 
721 GCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCCCGCGACGCAGCTTCGACGT 
CGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGGGCGCTGCGTCGAAGCTGCA 

HisIleAspLeuLeuValGlySerAlaThrLeuCysSerAlaLeuTyrValGlyAspLeu 
T8 1 CACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGCCCTCTACGTGGGGGACCTA 
GTGTAGCTAGACGAACAGCCCTCGCGGTGGGAGACAAGCCGGGAGATGCACCCCCTGGAT 

CysGlySerValPheLeuValGlyGlnLeuPheThrPheSerProAraArgHisTrpThr 
541 TGCGGGTCTGTCTTTGTTGTCGGCCAACTGTTCACCTTCTCTCCCAGGCGCCACTGGACG 
ACGCCCAGACAGAAAGAACAGCCGGTTGACAAGTGGAAGAGAGGGTCCGCGGTGACCTGC 

ThrGlnGlyCysAsnCysSerlleTyrProGlyHisIleThrGlyHisArgMetAlaTrp 
901 ACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAACGGGTCACCGCATGGCATGG 
TGCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGCCCAGTGGCGTACCGTACC 

AspMet^tMetAsnTrpSerProThrThrAlaLeuValMetAlaGliiLeulAuArgri^ 
961 GATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAATGGCTC AGCTGCTCCGGATC 
CTATACTACTA<OTGACCAGGGGATGCnXK:CGCAACCATTACCGAGTCGACGAGGCCT^^ 

ProGlnAlalleLeuAspMetlleAlaGlyAlaHisTrpGlyValLeuAlaGlylleAla 
1021 CCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGGAGTCCTGGCGGGCATA^ 

GGTGTTCGGTAGAACCTGTACTAGCGACCACGAGTGACCCCTCAGGACCGCCCGTATCGC 

TyrPheSerMetValGlyAsnTrpAlaLysValLeuValValLeuLeuLeuPheAlaGly 
1 : 3 1 TATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGTGCTGCTGCTATTTGCCGGC 
ATAAAGAGGTACCACCCCTTGACCCGCTTCCAGGACCATCACGACGACGATAAACGGCCG 

ValAspAlaGluThrHisValThrGlyGlySerAlaGlyHisThrValSerGlyPheVal 
114 1 GTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCACACTGTGTCTGGATTTGCT 
CAGCTGCGCCTTTGGGTGCAGTGGCCCCCTTCACGGCCGGTG^^ 

SerLeuLeuAlaProGlyAiaLysGlnAsnValGlnLeuIleAsnThrAsnGlySerTrp 
1201 AGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGAXCAACACCAACGGCAGTTGG 
TCGGAGGAGCGTGGTCCGCGGTTCGTCTTGCAGGTCGACTAGTTGTGGTTGCCGTC^ 

HisLeuAsnSerThrAlaLeuAsnCysAsnAspSerLeuAsnThrGlyTrpLeuAlaGly 
1261 CACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCTCAACACCGGCTGGTTGGCAGGG 
GTGGAGTTATCGTGCCGGGACTTGACGTTACTATCGGAGTTGTGGCCGACCAACCGTCCC 

LeuPheTyrHisHisLysPheAsnSerSerGlyCysProGluArgLeuAlaSerCysArg 
1321 CTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGA 
GAAAAGATAGTGGTGTTCAAGTTGAGAAGTCCGACAGGACTCTCCGATCGGTCGACGGCT 
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ProLeuThrAspPheAspGlnGlyTrpGlyProlleSerTyrAlcAsnGlySerGlyPro 
^ 381 ccccttaccgattttcaccagggctggggccctatcafittatgccj^cggaagcggcrc^ 
ggggaatx;gctaaaactggtcccgaccccgggatagtcaatacgc;ttgcct^ 

AspGlnArgProTyrCysTrpHisTyrProProLysProC7sGl»''IleValPzroAlaLys 
1441 GACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTGCGG'JATTC^^ 

CTGGTCGCGGGGATGACGACCGTGATGGGGGGTTTTGGAACGCCATAACACGGGCGC 

SerValCysGlyProValTyrCysPheThrProSerProValVaiValGlyThrThrAsp 
L50 1 AGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGTGGTGGTGGGAAC^^ 
TCACACACACCAGGCCATATAACGAAGTGAGGGTCGGGGCACCACCACCCl^ 

ArgSerGlyAlaProThrTyrSerTrpGlyGluAsnAspThrAspvalPheValLeuAsn 
1561 AGGTCGGGCGCGCCCACCTACAGCTGGGGTGAAAATGATACGGACGTCTTCGTCCTTAAC 
TCCAGCCCGCGCGGGTGGATCTCGACCCCACTTTTACTATGCCTGCAGAA^^ 

AsnThrArgProProLeuGlyAsnTrpPheGlyCysThrTrpMetAsnSerThrGlyPhe 
1621 AATACCAGGCCACCGCTCGGCAATTGGTTCGGITGTACCTGG ATGAACTCAAC^ 
TTATGGTCCGGTGGCGACCCGTTAACCAAGCCAACATGGACCTACTTGAGTTC 

ThrLysValCysGlyAlaProProCysVallleGlyGlyAlaGlyAsnAsnThrLeuHis 
.681 ACCAAAGTGTGCGGAGCGCCTCCTTGTGTCATCGGAGGGGCGGGCAACAACACCCTGCAC 
rGGTTTCACACGCCTCGCGGAGGAACACAGTAGCCTCCCCGCCCGTTGTTGTGGGACGTG 

CysProThrAspCysPheArgLysHisProAspAlaThrTyrSerArgCysGlySerGly 
1741 TGCCCCACTGATTGCTTCCGCAAGCATCCGGACGCCACATACTCTCGGTGCGGCTCCGGT 
ACGGGGTGACTAACGAAGGCGTTCGTAGGCCTGCGGTGTATGAGAGCCACGCCGAGGCCA 

ProTrpIleThrProArgCysLeuValAspTyrProTyrArgLeuTrpHisTyrProCys 
1801 CCCTGGATCACACCCAGGTGCCTGGTCGACTACCCGTATAGGCTTTGGCATTATCCTTC 
GGGACCTAGTGTGGGTCCACGGACCAGCTGATGGGCATATCCGAAACCGTAATAGGAACA 

ThrlleAsnTyrThrllePheLysIleArgMetTyrValGlyGlyValGluHisArgLeu 
1361 ACCATCAACTACACCATATTTAAAATCAGGATGTACGTGGGAGGGGTCGAACACAGGCTG 
TGGTAGTTGATGTGGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCTTGTGTCCGAC 

GluAlaAlaCysAsnTrpThrArgGlyGluArgCysAspLeuGluAspArgAspArgSer 
1 ? 2 1 G AAGCTGCCTGCAACTGGACGCGGGGCGAACGTTGCGATCTGG AAG ACAGGGACAGGTCC 
CTTCGACGGACGTTGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGG 

GluLeuSerProLeuLeuI^uThrThrThrGlnTrpGlnValLeuProCysSerPheThr 
1981 GAGCTCAGCCCGTTACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACA 
CTCGAGTCGGGCAATGACGACTGGTGATGTGTCACCGTCCAGGAGGGCACAAGGAAGTGT 

Thrl^uProAlaLeuSerThrGlyLeuIleHisLeuHisGlnAsnlleValAspValGln 
204 1 ACCCTACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCAGAACATTG 

TGGGATGGTCGGAACAGGTGGCCGGAGTAGGTGGAGGTGGTCTTGTAACACCTGC^^ 

TyrLeuTyrGlyValGlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValVal 
2101 TACTTGTACGGGGTGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTT 
ATGAACATGCCCCACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAA 

LeuI^uPheLeuLeiiLeuAlaAspAlaArgValCysSerCysLeuTrpMetMetLeuLeu 
2161 CTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGGTCCTGCTTGTGGATGATC 

GAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAG 
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1 ...I 

TAIAOWTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGT^ 

CCCTGCGTGCCAGAACATAGGAA«5AaaCAA<a^CGAAA^ 

^^S^°JS^^SSSiSS'^^^'^<^C^™:«5GATGTGaCTCTC^ 
TTCACCCACGGGCCTCGCCMATCTGGAAGATGCCCTACACCGGAGAGGAGGACG^ 

^G'SESiSSlS^^^l^c^SgSSSJSi^ 

AACCGCAACGGGGTCGCCCGCATGCGCGACCTCTGCCTCCACcS^ 



:4oi 



2461 




.•^CCACGAACACCACCGAAGTCAXAAAAGACTGGTCTCACCTTCGCGTTGACGTGCACACC 
T»AaSG0C5GMIT«am:iCCCCCCOK<:T§S^ 

GTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGGGAAACC 

TAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAAGAGGCC 
?^'§SJ^^I^G^'S^^?n!i^.^^^^^^^ 




^p^ 1 5;S^iy*i?Jf:iJJliSiy^^^'^V*^^rAsnHisI^uThrProLeuArgA 
"'^^S^TTACTGGCACCTATGTTTATAACCATCT 

AATCCCaSCGAATGACCGTGGATACAAATATTGGTAGAGTCAGGAGAAGOICTCACC^ 

TGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCCACTCTAGTAGTTGCCGAAC 

GGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACC^ 
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LysGlyTrpArgljeuLeuAlaProIleThrAlaTyrAlaGlnGLiThxArgGlyLeuLeu 
-^061 AAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAtSACAAGGGGCC^ 
TTCCCCACCTCCAACGACCGCGGGXMTGCCGCATGCGGGTCGTCTGTTC 

GlyCysIlelleThrSerLeuThrGlyArgAspLysAsnGlnValGluGlyGluValGln 
3121 GGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAAGTGGA^ 
CCCACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGCT 

IleValSerThrAlaAlaGlnThrPheLeuAlaThrCysIleAsnGlyValCysTrpThr 
3181 ATTGTGTCAACTGCTGCCCAAACCTTCCIXKXZAACGTGCATCAAT^^ 
TAACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTAGm 

ValTyrHisGlyAlaGlyThrArgXhrlleAlaSerProLysGlyProVairieGlnMet 
3241 GTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCA^ 
CAGATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGG 

TyrThrAsnValAspGlnAspLeuValGlyTrpProAlaProGlnGlySerArgSerLeu 
3301 TATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTA^^ 
ATATCGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATC^ 

ThrProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHisAlaAspVallle 
361 ACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACGCCGATGTCATT 
TGTGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCXCCGTGCGGCTACAGTAA 

ProValArgArgArgGlyAspSerArgGIySerLeuLeuSerProArgProIleSerTyr 
3421 CCCGTGCGCCGGCGGGGTG ATAGCAGGGGC AGCCTGCTGTCGCCCCGGCCCATTTCCTAC 
GGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGGCCGGGTAAAGGATG 

LeuLysGlySerSerGlyGlyProLeuLeuCysProAlaGlyHisAlaValGlyllePhe 

3481 ttgaaaggctcctcggggggtccgctgttgtgccccgcggggcacgccgtxk;gcatatot 
aactttccgaggagccccccaggcgacaacacggggcgccccgtgcggcacccgtataaa 

ArgAlaAlaValCysThrArgGlyValAlaLysAlaValAspPhelleProValGluAsn 
3541 AGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCCTGTGGAGAAC 
TCCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTG 

LeuGluThrThrMetArgSerProValPheThrAspAsnSerSerProProValValPro 
5601 CTAG AG ACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTCTCCACCAGTAGTGCCC 
GATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAGAGGTGGTCATCACGGG 

GlnSerPheGlnValAlaHisLeuHisAlaProThrGlySerGlyLysSerXhrLysVal 
3661 CAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAGCGGCAAAAGCACCAAGGTC 
GTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGTTTTCGTGGTTC 

. ProAlaAlaTyrAlaAlaGlnGlyTyrLysValLeuValLeuAsnProSerValAlaAla 
3721 CCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACTCAACCCCTCTGTTGCTC 
GGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGAGTTGGGGAGACAACGA^^ 

ThrLeuGlyPheGlyAlaTyrMetSerLysAlaHisGlylleAspProAsnlleArgThr 
3781 ACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGATCGATCCTAACATC^ 

TGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGG 

GlyValArgThrlleThrThrGlySerProIleThrryrSerThrTyrGlyLysPheLeu 
3841 GGGGTG AGAACAATTACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTT 
CCCCACTCTTGTTAATCGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGG^ 
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CC3GCTGCCaXCaCGAGCCCCCC«XAATACrGTATTATTAAACACTXKr^ 
' at 1 rjjf ^P?i2''*'SerIleLeuGlyileGlyThrValLeuAapGlnAlaGluThr AlaGly 
TGCCTACGGTGTAGGTMAACCCGTAGCCGIGACAGGAACTGGTTCGTC^ 

;taggg 



CGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGG' 



4081 ffiJi^S^lS??SSgSigc^ 

TTGTAGCTCCTCCAAa3«a«aGGIGGlGGCCTCTCTAGddA^ 

ggggagcttcattagttccccccctctgtagagtagaagSaa^^ 

rfSa^)i^i^?^'^5^"^*^^"<^^yIl«^snAlaValAlaryrTyrArgGly 

T»CT«KC<aT»T0GCC0CI0»«0CI0»KC«SiS5S^ 
ISIC*GCt»»MTCGC«ACI«sareG»MTOW»CICTOT 

PheVa 

.501 tttgtcgcaccggg^ 



ogagggcgtgagttgcagccccgtcctgaccgtcccc^^ 

AAACACCGTGGCCCCCTCGCGGGGAGGCCGxSaS^ 

CGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACCCTCCCG 

CAGAAATGTCCGGAGTCyUSTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCT^ 
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GliiAsnLeiiProTyrl^uValAlaTyrGlnAlaThrValCysALa^rgAlaGliiAlaP^^ 
4741 GAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGC^^ 
CTCTIXKSAWKyUVTGGACCATCGCATGGTTCGGTGGCAC^ 

ProProSerTrpAspGlnMetTrpLysCysLeuIleArgLeuLysl'roThrLeuHi 
4801 CCCCCATCGTGGGACCAGATGTGGAAGTGTTrcATTCGCCTCAAGCCC^^ 

GGGGGTAGCACCCTGGTCTACACCTTCACAiACTAW^CGGAGTTCGGGl^^ 

ProThrProLeuLeuTyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisPro 
4861 CCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCA^ 
GGTTGTGGGGACGATATGTCTGACCCGCGACa^TCTTACTTT 

ValThrLysTyrlleMetThrCysMetSerAlaAspLeuGluValValThrSerThrTrp 
4921 GTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGAGGTCGTCACGA^ 
CAGTGGTTTATGTAGTACTCTACGTACAGCCGGCTGGACCTCCAGCAGTGCT^ 

Vall^uValGlyGlyValLeuAlaAlaLeiiAlaAlaTyrCysl^uSerThrGlyCysVal 
4981 GTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGCCTC 

CACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCAC 

VallleValGlyArgValValLeuSerGlyLysProAlallelleProAspArgGluVal 
5041 GTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTC 
CAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTC^ 

LeuTyrArgGluPheAspGluMetGluGluCysSerGlnHisLetiProTyrlleGluGln 
5 10 1 CTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAA 
GAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGXGAATGGCATGTAGCTCGTT 

GlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSer 
5161 GGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCnXIGGCCTCCT^^ 

CCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACGT^^ 

ArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLysLeuGluThrPhe 
5221 CGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTC 
GCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTC 

rrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAlaGlyLeuSerThr 
3:81 rCGGCG AAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACG 
ACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGC 

LeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThrAlaAlaValThrSerPro 

5341 CTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACAGCTGC^ 

GACGGACCATTGGGGCGGTAACGAAGTAAClACCGAAAATGTCGACGAaVGTGGTCGGGT 

LeuThrThrSerGlnThrLeuLeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeu 
5401 CTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCC^^ 
GATTGGTGAimSTTTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGAC^ 

AlaAlaProGlyAlaAlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGly 
5461 GCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTAGCTGGCGCCGCCATC 

CGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCG 

SerValGlyLeuGlyLysValLeulleAsplleLeuAlaGlyTyrGlyAlaGlyValAla 
5521 AGTGXTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGGCGT^ 
TCAO^CCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGC 
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CGGAAGa;GA(3GGCCCCCTTGGTAauUM;GGO;TGCGT(^ 



- ao 1 ^iS?i?i52Y*^^AlaIleI*uSerSerLeuThrValThrGlnI*uI«uAro^ 



6061 _ _ 

CACCTGCCGTAGTACGTGTGMCGACGGTGACACCTOTACTCTA^^ 

ATGCACTGCCMTACTGATCACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTT 

S361 

aaaaagtgtcttaacctgcccc:acgcggatgtatccaaaccx:gg^ 
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7081 

AspAlaGluSerTvrSerSe'rM*»*-DT»rtD*.^r-../-i..^,.->„ • . 

7141 



CTGCGACTCAGGATAAGGiWSGTACGGGGGGGACCTCCC^^^ 
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SerMetSerTyrSerTrpThrGlyAlaLeuValThrProCysAlaAlaGluGluGlnLys 
7261 TCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCCCG^ 

AGTTACAGAATGAGAACCnCTCCGCGTGAGCAGTGGGGOlCGCGGCGCC^ 

LeuProIleAsnAlaLeuSerAsnSerLeuLeuArgHisHisAsoLeuValTyrSerThr 
"321 CTGCCCATCAATGCACTAAGCAACTCGTTCCTACGTCACCACAATTTG^ 
GACGGGTAGTTACGTGATTCGTrKyw;CAACGATGCMTGGTC 

ThrSerArgSerAlaCysGlnArgGlnLysLysVaiThrPheAspArgLeuGlnValLeu 
7381 ACCTCACGCAGTGCTTCCCAAAGGCAGAAGAAAGTCACATTTGACAGACl^^ 
TCGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTG^ 

AspSerHisTyrGlnAspValLeuLysGluValLysAlaAlaAlaSerLysValLysAla 
7441 GACAGCCATTACCAGGACGTACTCAAGGA6GTTAAAGCAGCGGCGTCAAAAGTC 
CTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCGCCGCAGTTT^^ 

AsnLeuLeuSerValGluGluAlaCysSerLeuThrProProHisSerAlaLysSerLys 
7501 AACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATC 

TTGAACGATAGGCATCTCCTTCGAACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTC 

PheGlyTyrGlyAlaLysAspValArgCysHisAlaArgLysAlaValThrHisIleAsn 
• 5 6 1 TTTGGTTATGGGGCAAAAGACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCAC ATCAAC 
AAACCAATACCCCGTTTTCTGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTC 

SerValTrpLysAspLeul^uGluAspAsnValThrProIleAspThrThrlleMetAla 
7621 TCCGTGTGGAAAGACCTTCTGG AAGACAATGTAACACCAATAGACACTACCATCAT^ 
AGGCACACCTTTCTGGAAGACCTTCTCTTACATTGTGGTTATCTGTGATGGTAGT^^ 

LysAsnGluValPheCysValGlnProGluLysGlyGlyArgLysProAlaArgLeuIle 
7681 AAGAACGAGGTTTTCTGOTTTCAGCCTGMAAGGGGGGTCGTA^ 
ITCTTGCTCCAAAAGACGCAAGTCGGACTCTTCCCCCCAGCATTCG^ 

ValPheProAspI^uGlyValArgValCysGluLysMetAlaLeuTyrAspValValThr 
774 1 GTGTTCCCCGATCTGGGCGTGCGCGTGOXXIGAAAAGATGGCTTTGTACGACGTGG 
CACAAGGGGCTAGACCCGCACGCGCACACGCTTTTCTACCGAAACATGCTC 

LysLeuProLeuAlaValM.etGlySerSerTyrGlyPheGlnTyrSerProGlyGlnArg 
- 30 1 AAGCTCCCCTTGGCCGTCATGGGAAGCTCCTACGGATTCCAATACTCACCAGGA^ 
TTCGAGGGGAACCGGCACTACCCTTCGAGGAOXSCCTAAGGTTATGAGTGGTX:^^^ 

ValGluPheLeuValGlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAsp 
7861 GTTGAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTAT^ 
CAACTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGTT^ 

ThrArgCysPheAspSerThrValThrGluSerAspIleArgThrGluGluAlalleTyr 
7921 ACCCGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGC^ 
TGGGCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGT^^ 

GInCysCysAspLeuAspProGlnAlaArgValAlalleLysSerLeuThrGluArgLeu 
7981 CAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACCGAGAGGOT 
GTTACAACACTGGAGCTGGGGGTTCGGGCGOVCCGGTAGTTCAGGGAGTGGCTC^^ 

TyrValGlyGlyProLeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArg 
8041 TATGTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGCAGGTGCCGC 
ATACAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCG 
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3101 
3161 
3221 
3281 
8341 
3401 
8461 
3521 
3581 
3641 
8701 
3761 
8821 



AlaSerGlyValLeuThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAIaArg 

GCGAGCGGCGTWZTGACAACTAGCTGTGGTAACACCCTCACTTGCTACATC^^ 

CGCTCGCCGCATCACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTC 

AlaAlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeu 
GCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCGACGACTTA 
CGTCGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAAT 

ValVallleCysGluSerAlaGlyValGlnGluAspAlaAlaSerLeuArgAlaPheThr 
GTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACG 
CAGCAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGC 

GluAlaMetThrArgTyrSerAlaProProGlyAspProProGlnProGluTyrAspLeu 
GAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCCACAACCAGAATACGACTTG 
CTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTGGGGGGTGTTGGTCTTATGCTGAAC 

GluLeuIleThrSerCysSerSerAsnValSerValAlaHisAspGlyAlaGlyLysArg 
GAGCTCATAACATCATGCTCCTCCAACGTGTCAGTCGCCCACGACGGCGCTGGAAAGAGG 
CTCGAGTATTCTAGTACGAGGAGGTTGCACAGTCAGCGGGTGCTGCCGCGACCTTTCTC 

ValTyrTyrLeuThrArgAspProThrrhrProLeuAlaArcTAlaAlaTrpGluThrAla 
GTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCA 
CAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCTCGACGCACCCTCTGTCGT 

ArgHisThrProValAsnSerTrpLeuGlyAsnllelleMetPheAlaProThrLeuTrp 
AGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGG 
TCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTACAAACGGGGGTGTGACACC 

AlaArgMetlleLeuMetThrHisPhePheSerValLeuIleAlaArgAspGlnLeuGlu 
GCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTATAGCCAGGGACCAGCTTGAA 
CGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATATCGGTCCCTGGTCGAACTT 

GlnAlaLeuAspCysGluIleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuPro 

CAGGCCCTCGATTCCGAGATCTACGGGGCCTGCTACTCCATAGAACCACTTGATCT^ 

GTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTATCTTGGTGAACTAGATGGA 

ProIlelleGlnArgLeuHisGlyLeuSerAlaPheSerLeuHisSerTyrSerProGlv 

CCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACTCCACAGTTACTCTC^ 

GGTTAGTAAGTTTCTGAGGTACCGGAGTCGCGTAAAAGTGAGGTGTCAATGAGAGGTCCA 

GluIleAsnArgValAlaAlaCysLeuArgLysLeuGlyValProProLeuArgAlaTrp 
GAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGTACCGCCCTTGCGAGCTTGG 
CTTTAATTATCCCACCGGCGTACGGAGTCTTTTGAACCCCATGGCGGGAACGCTCG^^ 

ArgHisArgAlaArgSerValArgAlaArgLeuLeuAlaArgGlyGlyArgAlaAlalle 
AGACACCGGGCCCGGAGCGTCCGCGCTAGGCTTCTGGCCAGAGGAGGCAGGGCTGCCATA 
TCTGTGGCCCGGGCCTCGCAGGCGCGATCCGAAGACCGGTCTCCTCCGTCCCGACGGTAT 

CysGlyLysTyrLeuPheAsnTrpAlaValArgThrLysLeuLys 

TGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCTCAAAC 

ACACCGTTCATGGAGAAGTTGACCCGTCATTCITGTTTCGAGTTTG 
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"1-1519 

prloer J150S 

:i ACTgw?rqMCTrTr,^A'm^ 

PT G 

SarLeuLysmGlyP&eLeuAlaAlaLcufheTrrThrHlBtyBPhoABnAlaierGi 
I CTCCCTCAAAACTCGCTTTC7TCCCGCCC7<rrTCr:ACACACAC.UGrrCAACGCCTCCGG 30 

TAG C C C GCTT G A G T TCAC 7 T A 

^ Trp Gly Scr 

primer 166A for j1-1216 



yCycProCluArrietAlaSerCysArsfSerileASPLysPhcAspOXaGlyTrpGl^Pr 
.:l f>: ATCCCCGGAGCOCATGGCCAGCTGTCGCXCATTCACAACTTCGACCAGGGATGGGGTCC 
P T C T T A GC A C AC C AC G T T C C 

Leu ProLau ThrAsD 



CileTHrTyrAiaGlnProAspAsnSerAspGlnArffProTyrCycTrpHiiTyrAlaPr 
IZI CATCACCrATGCTCAACCTGACAACTCCCXCCACAGGCCCTATTGCTGCCACTACGCACC : 80 
T G7 -CA CGGAAG GGC: CCCC :c 

■ -Ser. A3r.*31y SerGly Pra p^'^ 



^AryCXnCysGlrIleVal?roAlaSc^GlnValCy6GlyP^oVsITyrCysPhcT^r?^ 
J i IS 1 TCCACAGTGTGGTATCCTACCCGCGTCGCACC7GTGCGGTCCA0T0TATTGCTTCACCCC 
PT AAA CT C T G M AGT T G A 



Z40 



oScrProValValVaiClyThrThrABpArffPheClyAlaPromrTrrAsnTrpGlyAs 
AAGCCCTGTTGTAGTGGCGACGACCGATCGTrrcCGCaCCCCTACGTATAACTCGGGCCA 
C CGO A CAGCC GCCCG T 

Ser Scr Cl 



300 



CAsaGlUTUrAspValLeuLeuLeuAanAsnThrArsProProHlcGiyAanTrDPheC: 
Jl :101 CAATGAGACCCACCTGCTGCTCCTAAACAACACCCGGCCCCCGCACGGCAACTGGTTCGG J60 
- AT CT CG T T CA A. TG 7 

u Aap PheVdl Leu 



yCysThr 
Jl 361 CTGTACA 387 

PT ? CTGGATGAACTCAArT^RATT 

primer 199A 



Nucieotlda Maceh: 259/36T (70.6XJ 

Amino Add MaccR Istrlnrent): 93/122 (75.2*) 
(ralAxad): lii/122 (51. OX) 

PrototTpe KCV (PT) act^ucnces dlfforenc from Japanese KCV (Jl; arc shown, 
ftelaxod aoino aeld matcli: Gl^«Aia»Pro-3cr-Thr. .\ap-Glu, Asn-Glr.. 
V3 te»-Lys=Hls. Leu-Zle-Val-Met. Pb«-Trp-Tyr. 
UfTderlin*. dlffaranr amino acid in relaxed matching. 
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FIGURE 14-1 

core to NSl vs. HC^l 



ProLeuVal 
TCCGCTCGTC 
A 

GlyAlaProLeuGlyGlyAlaAlaArgAlaLeuAlaHisGlyValArgValLeuGluAsp 
.1 GGCGCCCCCTT^GGGGCGCIGCCAGGGCCCTGGCACATGGTC 



.71 



Jl 



Jl 



131 



251 



311 



431 



jiZ)J2i^2Z?E^?TlJf5iZ^^"^^^^?^sSerPheSerIlePhe^ 

-CC-T~T- 



-TC 



LeuLeiiSerCysLeuThrlleProAlaSerAlaTyrGluValArgAsnValSerGlylle 




Pro 



Ala 



MetHisAlaProGlyCysValProCysValArgGluAsnAsnSerSerArgCysTrpVal 
ATGCATGCCCCCGGGTGCGTGCCCTGCGTTCGGGAGAACAATTCCTCCCGTTGC^^ 
C CA-T— G C~T T GG CG T- A 



Leu rhr 



Gly. Ala 




^^2^fi^^P^®^«"ValGlyThrAlaAlaPheCysSerAlaMetTyrValGlvAspLeu 
. uACGTCGACTTGCrCGTTGGGACGGCTGCTTTCTGCTCCGCTATGT^^ 



-A TC T~C GC~CA-CC T— CC-C 

Ser ThrLeu Leu 



CysGlySer ValPh eLeuIleSerGlnLeuPheThrPheSerProArqArgHisGluThr 
TGCGGATCTGTTTTCCTCATCTCCCAGCTGTTCACCTTCTCGCCTCGCCGGCATGAGACA 

G C — T — TG — GG A T — CA-G — C — CTG G 

ValGly Trp 



491 



351 



611 




AspMetMetMetAsnTrpSerProThrAlaAlaLeuValValSerGlnLeuLeuArglle 
GAXATCATCATGAACTGGTCGCCCACGGCAGCCTTAGTGGTGTCGCAGTTACTCCGGATC 

C~T A-G— G— G— AA— <3-T C-G 

Thr MetAla 




-CA— T 



IleLeu 



-A-C~T~T— T- 
Ile 



•A-A— <3 
lie 



671 



TyrTyrSerMetValGlyAsnTrpAlaLysValLeuIleValMetLeuLeuPheAlaGly 
TACTATTCCATGGTGGGGAACTGGGCTAAGGTTTTGATTGTGATGC^ 
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— T-TC G CC~G"A C G~A 

Phe Via Leu 

ValAspGlyHisThrArgValThrGlYGlyValGlnGLyHtsValThrSerThrLeuThr 
31 GTTGACGGGCATACCCGCGTGACGGGGGGGGTGCAAGGCCACGTCACCTCTACACTC^ 

~C C-G-A A — -C--C AAGTGCC- ACTGTG-—GGAT-TGTT 

AlaGLu His SerAla ThrVai GlyPheVal 

Serl^uPheArgProGlyAlaSerGlnLysIleGlnleuValAsnThrAsnGlySerTrp 
'91 TCCCTCTTTAGACCTGGGGCGTCCCAGAAAATTCAGCTTGTAAACACCAATGGCAGTT^ 

AG C-CGC A~C~CAAGC CG-C GA-C C 

LeuAla Lys AsnVal He 

HisIleAsoArgThrAlaLeuAsnCysAsnAspSerLeuGlnThrGlyPheLeuAlaAla 
851 CATATCAACAGGACTGCCCTGAACTGCAATGACTCCCTCCAAACTCGGTTCCT 

~CC T—C— G TAG A-C— C—C-GGT-G-'-A-G- 

Ser Asn Trp Gly 



911 CTGTTCTACACACACAAGTTCAACGCGTCCGGATGCCCGGAGCGCATGGCCAGCTC 

~T TCACC T-T~A— C~T— T A-GC-A C— A 

His Ser Leu 

SerIleAspLysPheAspGlnGlyTrt>GlyProIleThrTyrAlaGlnProAsDAsnSer 
971 TCCATTG ACAAGTTCGACCAGGGATGGGGTCCC ATC ACCTATGCTCAACCTGACAACTCG 

C~C~AC-G-T— T C C~T GT CAACGGAAGCGG-C-C 

ProLeuThrAsp Ser AsnGlySerGlyPro 

AspGlnArgProTyrCysTrpHisTyrAlaProArgGlnCysGlylleValProAlaSer 
1031 GACCAGAGGCCGTATTGCTGGCACTACGCACCTCGACAGTGTGGTATCGTACCCGCGTCG 

C-C— C~C C-C— AAA~CT— C T — G AA- 

Pro LysPro Lys 

GlnValCysGlyProValTyrCysPheThrProSerProValValValGlyThrThrAsp 
1091 CAGGTGTGCGGTCCAGTGTATTGCTTavCCCCAAGCCCTGTTGXAGTGGGGACGACK 

AGT T G — A T — C C~G — G A 

Ser 

ArgPheGlyAlaProThrTyrAsnTrpGlyAspAsnGluThrAspValLeuLeuLeuAsn 
115 1 CGTTTCGGCGCCCCTACGTATAACTGGGGGGACAATGAGACGGACGTGCTGCTCCTAAAC 

A-G-CG G~C— C— C-G T— A T CT-CG T 

Ser Ser Glu Asp PheVal 

AsnThrArgProProHisGlyAsnTrpPheGlyCysThr 
1211 AACACGCGGCCCCCGCACGGCAACTGGTTCGGCTGTACA 

~T~CA A TG T T 

Leu 
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TGG 

-X T C CTCA k AGT- 

Val 



ProProCysAsnlleGlyGlyValGlyAsnAsnlhrLeuThrCysProThrAspCysPhe 
61 GCCCCCCGTGTAACATCGGGGGGGTCCX;CMCAACACCTTGACCTGCCCCACGGACTGCT 

CG— T— T GT A CG C— CA T~T 

Val Ala His 

ArgLysThrProThrAlaThrTyrThrLysCysGlySerGlyProTrpLeuThrProArg 
121 TCCGGAAGACCCCGACGGCCACTTACACAAAATGTGGTTCGGGCCCTTGGTTGACA^ 

C CAT GAC A T-TCGG — C~C— •C~T~C A-C C- 

His Asp SerArg He 

CysLeuValAspTyrProTyrArgLeuTrpHisTyrProCysThrValAsnPheThrlle 
121 GGTGCTTGGTTGACTACCCATACAGGCTCTGGCACTACCCCTGCACTGTCAACTTTACCA 

r Q G — t T T — r — T — T — CA AC 

lie Tyr 

PheLysValArgMetTyrValGlyGlyValGluHis 
241 TCTTCAAGGTTAGGATGTATGTGGGGGGCGTGGAGCAC 

-A~T— AA-C C A~G— C~A 

lie 
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C200 ration saquenos vs. HCV-1 



.—CO AMjtetSertyaAlaHisGlyThrAspProAanlleArgThr 

3781 ACACTt^^CCTTrS^rJi^^ 

ThrLwiaiyPheGlyAlaTyr * He 



^ ^» A — A — T T— CAfi C CS 

Ser Gly 



* G X A TTTGTGACGAGTGCC\CTCC 

lleCysAspGluCysHisSer 



C200 Serllc 
AsniieGluGluVttlAlaLeuSerThrThtGlyCluIlBProPheTyrGlytysAla 



C200 4141 CCCMCGAGGCCAiSI^^ 

^ y2 ' ^ C 



aCV 1 A -GTC—AIH:— CA — C— C 

Val He 
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4261 CTTCATOTGTCCGTCATACCAACTAGC^ 

~— C— — — — c — <3— ' .1^— "J— ^ — — Q — «p — cere 



4321 AIXIACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTC 

r-tetThrGlyTTTThrGlyAspPheAspSerVallleAspCysAsnThr^ 



4381 ACAGTCGATTTCAGCCTTCACCCTACCT^^ 

ThrValA6pPhftSerLeuA«pProThrPheThrIleGluThrIleThrLeuProGlnAsp 



4441 GCTGTCTCCCGCACTCAAO:^ 

AlaVaiSerArgThrGlnArgArgGlyArgThrGlyArgGlyLysProGIyllerTTArg 



4501 TTTOTGGCA 

PheValAlaProGlyGluArgProSer<;iyMetPheAspSerSerValLeuCysGluCys 

4561 ^TGACGCMGCTGTGOT 

TyrAspAlaGlyCyaAlaTrpTyrGlul^uThrProAlaGluThrThrValArgLeu^ 



4621 GCGTACATGAACACCCCGGGGCTTCCCGTGTCCCAGGAC^ 

AlaTyrMetAanThrProGlyLeuProValCysGlnAapHisLeuGluPhcTrpGluGly 



4681 GTCTTOAC^ 

ValPheThrGlylAuThTHlslleAapAlaHisPhaLa^erGlathrLyfiGlnSetGly 



4741 G AGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGC 

GluAaiil^uProTyrLeuValUaTyrGlnAlaThWalCysAlaArgAlaGlnAlaPro 



4801 CCCC»TCGTGa;ACCAGAlOTGGAAGTGTm 

ProProSerrrpAspGlnMetTrpLysCysLeuIleArgt^uLysProThPLeuHisGly 



4861 Caw^pCCCCTGCXAXJ!CAGACTGa 

ProThrProLeuLeuTyrArgLeuGlyAlaValGlnAanGluIleThrLeuThrHisPro 
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4921 <3XCI£CAAA3A<MCATGACATCCATCTCGGC^ 

VaiThrLysTy^Il^tetThrCysMetSerAlaAspLe^JGluvalVal^^lrSe^ 



4981 OTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATT^ 

VallAUValGlyGlyValLeuAlaAlaLeuAlaAlaTyrCysLeuSerThrGlyCysVal 



3IuVal 
** 0 9 4 '3^utGf C 

5041 GTCMAGTGGGCAGGGTCGTC^ 

VallleValGlyArgValValLftuSerGlyLysProAlallelleProAspArg 



5101 ^-CTACCGMMTTCGATGAGATGGAAGAGIGCGCCTCACACCTCCCClACATC^ 

T-TCAG r-A~G G~A 

SezGln 




, KTC-C G~C~GT- 

Ser 



^7-51 5:??$i?^*SiJ^*^^*PJ^oC^'»GluSer^ 
3221 AAGCAMCGGAGGCTGCTGC^^ 

CGT— G— A T-ATC— C— XGC— rCCAG-CCA— TGGCAA-AACTCGAGACCTTC 

Arg Vallls AlaValGlnThrAsnTrpGlnLysI/SuGluThrPhe 



5281 TGGGaSAAOMATGTGGAACTTCATCAGTGGGATACAATA 
TrpAlaLysHifiMetTrpAsnPhelleSexGlylleGln 
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MSI sequence vs. HCV-1 



-: -1 -c 



LeuGlyAsnTrpPheGlyCysThTTrpMetAsnSerSerGlyPhemLysVal^^ 
1 GTTGGGCAATTCKTTCGGTOra^CCTGGATGAACTCATC^^ 

~C ■ . M l « >, 



Thr 



^ , . AlaProProCysVailleGlyGlyValGlyAsnAsnThrLeuGlnCysProThrAsp^^ 

r.t„ , agcgcctccttgtgtcatcggaggggtgggcaacaacacctixk:a^ 

r.CV- 1 C ^ 

Ala His 
TTTCCG CAAGCATCO^GACGCCACAT ACTCTCGGT^ 

-•"A, , CAGGTGCCTGGTCCACTACCCTTATAGGCTTTGGCATTATC^ 

nCV-l G G 1 CA 

Asp He 

" , , ^LfJJJ2l5i*y^^*^9MetTyrValGlyGlyValGluHisArglAuGluVam 

.^^^ lie ^ 5r"^ 

TrpThrArgGlyGluArgCysAspLeuAspAspArgAsp 
.1 301 CTGGACGCGGGGCGAGCGTTGTGATCTGGACGACAGGGACA 

HCV-1 A C A 

Glu 
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•1 
v-1 



Core sequence vs. HC7-1 

GCGTCTATCCATGGCGTTAGTATGAGTC^ 



.^^^.^ G GGAGAGCCATAGTGGTCT GCGGAACCCXSTCaGTACACCGGA^ 
5c7-l TCCCTTCTTGGATCAACCCOrrCMTGCC^^ 

A 

^""^ TAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTOTGGTACrGCCTC 

n 237 GTGCCCCGGGAGGTCTCGTAGACCGTGCATCSSJ^S^ 

.-fCV-l c g ^ — 

Lys 

'Q-r ,:)HHfS52^2?!5*^°^^*^5P'^'^l°AspValLysPheProGlyGl7GlyGlnIle 
, -"^CCAAACGTAACACCAACCGCCGCCCACAGGACGTCAAGTTCCCGGGCGGTGGT^ 

"Sr ^ 

.1^ 357 cSSl^?S^?G"S?G°?Sg?^^S^a^^^ 

rfCv-l x T— A G— 

1 ^^^ACTTCCGAGCGGTCCraVACCTCGTQGAiW^GCG^ 

HCY-i A A— T— A~T— G T-G 

Arg 

vi.-l ' ' ^^CCGAGGGCAGGGCCTG 

Thr ^ 

XetGlyTrpAlaGlyTrpLeu 
•1 537 CATGGGGTGGGCAGGATGGCTCCT 

-TGC G 

Cys 
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